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1 

PENES INVOLVED IN PLANT FIBRE DEVELOPMENT 

FIELD OF THE INVENTION 

The present invention relates to polypeptides, and polynucleotides encoding 
therefor, involved in the regulation of fibre initiation and/or elongation in fibre 
5 producing plants. In particular, the present invention provides methods of altering fibre 
initiation in cotton and products thereof. The invention also relates to the use of these 
polypeptides and polynucleotides as markers of fibre production in plants including 
cotton. 

10 BACKGROUND OF TH E INVENTION 

Cotton {Gossypium hirsutum, and to a lesser extent Gossypium barbadense 
together with other Gossypium species) provides about 55% of the fibre used in textile 
manufacturing globally and is an important contributor to world economies. The cotton 
fibre is probably the most elongated cell in the plant kingdom. The molecular 
15 mechanisms that control the differentiation of this elongated plant cell are still largely 
unknown. Although commonly called fibres, these cells are not part of the vascular 
tissue and arise, instead, from the ovule epidermis. Fibres of cotton are extremely long 
single elongated epidermal cells that develop on the outer surface of cotton ovules, 
reaching upwards of 5 centimetres in some species. Fibre initiation starts between a day 
20 before and up to a day after anthesis and the fibre initials begin to elongate immediately 
after fertilisation, ballooning out from the surface of the seed coat epidermis. After a 
period of elongation, secondary cell wall thickening fills the fibre with cellulose and the 
fibre dies and collapses to form the mature fibre that is harvested from the seeds. 

In contrast to the discovery of numerous genes responsible for fibre elongation 
25 and secondary cell wall synthesis, few genes have been identified that are associated 
with fibre initiation. Early cytological studies showed structural changes in fibre initials 
occur up to three days before anthesis: including enlarged nucleoli and nuclei, as well as 
an increased number of Golgi complexes (Berlin, 1986). It has been proposed that the 
Arabidopsis leaf trichomes, which require at least twenty genes for normal development 
30 (Hulskamp et al., 1994), could serve as a model for elucidating the genetic mechanisms 
controlling cotton fibre initiation and differentiation. 

One of the first genes to be characterised in controlling leaf trichomes, 
GLABROUS 1 (GL1), encodes a member of the Myb family of transcription factors 
(Oppenheimer et al., 1991). An exhaustive search of a cotton ovule cDNA library 
35 recovered six novel Myb-domain genes, but none of them encoded a GL1 homolog 
(Loguercio et al., 1999). Arabidopsis trichome initiation is proposed to be controlled by 
a trichome promoting complex comprised of GL1, TRANSPARENT TESTA 
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GLABRA 1 (TTG1, a WD40 protein), and GLABRA3 (GL3, a basic Helix-Loop-Helix 
protein). GLABRA2 (GL2, a Homeodomain protein) regulates trichome morphology 
and spacing and TRIPTYCHON (TRY, a Myb-like protein) mediates lateral inhibition 
of trichome development in cells adjacent to each trichome (Rerie et al., 1994; Walker 
et al., 1999; Szymanski et al., 2000; Schellmann et al., 2002; Ohashi et al., 2002). 
However, genes with similar functions in cotton have yet to be identified, and hence it 
remains speculative whether these two single celled epidermal hair systems share any 
common features. 

There is a need for the identification and characterization of genes involved in 
fibre initiation in fibre producing plants such as cotton.. This will enable markers to be 
used to screen plants for desirable fibre traits, as well as allow for the production of 
transgenic plants with altered fibre production. 



SUMMARY OT? THE IN VENTION 

15 To identify genes that may be specific to fibre initiation, the present inventors 

have used mRNA from early stage fertilised ovules of wild type and 5 lintless mutants 
of cotton (that produce little if any fibres) to probe a cotton ovule cDNA microarray 
containing 10,000 cDNAs expressed around the time of fibre cell differentiation. Since 
pollination may already have occurred and zygote development initiated at this stage, 

20 the inventors used a separate microarray comparison between the mRNAs of the outer 
integument and those of the inner ovule tissues of the wild type cotton to filter out those 
genes that are not expressed specifically in the seed coat outer integument where the 
fibres are initiated. Using this strategy genes have been identified that are differentially 
expressed in the lintless mutants, and hence play a role in fibre initiation. 
•25 As a result, in one aspect the present invention provides a method of altering 

fibre initiation and/or elongation in a fibre producing plant comprising manipulating 
said plant such that the production of a polypeptide is modified when compared to a 
wild-type plant, wherein the polypeptide comprises a sequence selected from the group 
consisting of: 

30 i) an amino acid sequence provided as any one of SEQ ID NO's: 1 to 1 6; or 

ii) an amino acid sequence which is at least 50% identical to any one of SEQ ID 
NO's:l to 16. 

Preferably, the polypeptide comprises an amino acid sequence which is at least 
60%, more preferably at least 65%, more preferably at least 70%, more preferably at 
35 least 75%, more preferably at least 76%, more preferably at least 80%, more preferably 
at least 85%, more preferably at least 90%, more preferably at least 91%, more 
preferably at least 92%, more preferably at least 93%, more preferably at least 94%, 
more preferably at least 95%, more preferably at least 96%, more preferably at least 
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97%, more preferably at least 98%, more preferably at least 99%, more preferably at 
least 99.1%, more preferably at least 99.2%, more preferably at least 99.3%, more 
preferably at least 99.4%, more preferably at least 99.5%, more preferably at least 
99.6%, more preferably at least 99.7%, more preferably at least 99.8%, and even more 
5 preferably at least 99.9% identical to any one of SEQ ID NO's: 1 to 1 6. 

In one embodiment, the polypeptide comprises a sequence selected from the 

group consisting of: 

i) an amino acid sequence provided as SEQ ID NO: 1 ; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO: 1 . 

10 In another embodiment, the polypeptide comprises a sequence selected from the 

group consisting of: 

i) an amino acid sequence provided as SEQ ID NO:2; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO:2. 

In a further embodiment, the polypeptide comprises a sequence selected from the 

15 group consisting of: 

i) an amino acid sequence provided as SEQ ID NO:3 ; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO:3. 

In another embodiment, the method comprises recombinantly expressing the 

polypeptide in said plant. 

20 In an alternate embodiment, the method comprises reducing the level of the 

polypeptide endogenously produced by the plant. This can be achieved by any means 
known in the art. One example is by exposing the plant to an antisense polynucleotide 
or a catalytic polynucleotide which hybridizes to an mRNA molecule encoding the 
polypeptide. Another example is by exposing the plant to a dsRNA molecule that 

25 specifically down-regulates mRNA levels in a cell of an mRNA molecule encoding the 
polypeptide. 

In a further embodiment, the plant is a horticultural plant. 

In a particularly preferred embodiment, the plant is a species of the Genus 
Gossypium. 

30 In a second aspect, the present invention provides a method of assessing the 

potential of a fibre producing plant to produce fibre, the method comprising analysing 
the plant for a genetic variation in a polynucleotide associated with fibre initiation 
and/or elongation, wherein the polynucleotide comprises a sequence selected from the 
group consisting of: 

35 i) a nucleotide sequence provided as any one of SEQ ID NO's:17 to 45; or 

ii) a nucleotide sequence which is at least 50% identical to any one of SEQ ID 
NO's:17to45. 
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Preferably, the polynucleotide comprises a nucleotide sequence which is at least 
60%, more preferably at least 65%, more preferably at least 70%, more preferably at 
least 75%, more preferably at least 76%, more preferably at least 80%, more preferably 
at least 85%, more preferably at least 90%, more preferably at least 91%, more 

5 preferably at least 92%, more preferably at least 93%, more preferably at least 94%, 
more preferably at least 95%, more preferably at least 96%, more preferably at least 
97%, more preferably at least 98%, more preferably at least 99%, more preferably at 
least 99.1%, more preferably at least 99.2%, more preferably at least 99.3%, more 
preferably at least. 99.4%, more preferably at least 99.5%, more preferably at least 

10 99.6%, more preferably at least 99.7%, more preferably at least 99.8%, and even more 
preferably at least 99.9% identical to any one of SEQ ID NO's: 17 to 45. 

As the skilled addressee would be aware, the genetic variation associated with 
fibre initiation and/or elongation may be in the coding portion of a polynucleotide of the 
the invention, or may be genetically linked to be useful as a marker for fibre initiation 

15 and/or elongation. 

In one embodiment, the method comprises performing an amplification reaction 
on nucleic acids obtained from said plant, or nucleic acids synthesized using nucleic 
acids from said plant as a template, wherein the production of an amplicon in said 
amplification reaction indicates an association with fibre producing potential. 

20 In another embodiment, the method comprises performing an amplification 

reaction on nucleic acids obtained from said plant, or nucleic acids synthesized using 
nucleic acids from said plant as a template, wherein the lack of production of an 
amplicon in said amplification reaction indicates an association with fibre producing 
potential. 

25 In a further embodiment, the method comprises performing a hybridization 

reaction on nucleic acids obtained from said plant, or nucleic acids synthesized using 
nucleic acids from said plant as a template, wherein a detectable signal produced by the 
hybridization reaction indicates reduced fibre producing potential. 

In yet another embodiment, the method comprises performing a hybridization 
30 reaction on nucleic acids obtained from said plant, or nucleic acids synthesized using 
nucleic acids from said plant as a template, wherein the lack of a detectable signal by 
the hybridization reaction indicates reduced fibre producing potential. 

In an alternate embodiment, the polynucleotide is mRNA and the method 
comprises determining the levels of mRNA of the polynucleotide in the plant ovule at, 

35 or around, anthesis. 

In a third aspect, the present invention provides a method of assessing the 
potential of a fibre producing plant to produce fibre, the method comprising analysing 
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the plant for polypeptide involved in fibre initiation and/or elongation, wherein the 
polypeptide comprises a sequence selected from the group consisting of: 

i) an amino acid sequence provided as any one of SEQ ID NO*s:l to 16; or 

ii) an amino acid sequence which is at least 50% identical to any one of SEQ ID 
5 • NO's:l to 16. 

Preferably, the method comprises determining the levels of the polypeptide in the 
plant ovule at, or around, anthesis. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 
\0 i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:l, 

ii) a polypeptide comprising an amino acid sequence which is at least 87% 
identical to SEQ ID NO: 1 , and 

iii) a biologically active fragment of i) or ii), 

1 5 wherein the polypeptide regulates fibre initiation and/or elongation. 

Preferably, the polypeptide comprises an amino acid sequence which is at least 
95% identical to SEQ ID NO: 1. 

In a further aspect, the present invention provides a substantially purified . 
polypeptide selected from the group consisting of: 
20 i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO: 2, and 

ii) a biologically active fragment of i), 
wherein the polypeptide regulates fibre initiation and/or elongation. 

In yet another aspect, the present invention provides a substantially purified 
25 polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:3, 

ii) a polypeptide comprising an amino acid sequence which is at least 54% 
identical to SEQ ID NO:3, and 

30 iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

35 NO:4, 

ii) a polypeptide comprising an amino acid sequence which is at least 55% 
identical to SEQ ID NO:4, and 

iii) a biologically active fragment of i) or ii), 
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wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

5 NO:5, 

ii) a polypeptide comprising an amino acid sequence which is at least 50% 
identical to SEQ ID NO:5, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 
10 In another aspect, the present invention provides a substantially purified 

polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:6, 

ii) a polypeptide comprising an amino acid sequence which is at least 50% 
1 5 identical to SEQ ID NO:6, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 
20 i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO: 7, 

ii) a polypeptide comprising an amino acid sequence which is at least 79% 
identical to SEQ ID NO:7, and 

iii) a biologically active fragment of i) or ii), 

25 wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:8, 

30 ii) a polypeptide comprising an amino acid sequence which is at least 66% 

identical to SEQ ID NO:8, and 

iii) a biologically active fragment of i) or ii), 
wherein the polypeptide regulates fibre initiation and/or elongation. % _ 

In another aspect, the present invention provides a substantially purified 
35 polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:9, 
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ii) a polypeptide comprising an amino acid sequence which is at least 95% 
identical to SEQ ID NO:9, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 
5 In another aspect, the present invention provides s substantially purified 

polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO: 10, 

ii) a polypeptide comprising an amino acid sequence which is at least 67% 
1 0 identical to SEQ ID NO: 1 0, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 
15 i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:ll, 

ii) a polypeptide comprising an amino acid sequence which is at least 55% 
identical to SEQ ID NO:l 1, and 

iii) a biologically active fragment of i) or ii), 

20 wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:12, 

25 ii) a polypeptide comprising an amino acid sequence which is at least 59% 

identical to SEQ ID NO: 1 2, and 

iii) a biologically active fragment of i) or ii), 
wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
30 polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO: 13, 

ii) a polypeptide comprising an amino acid sequence which is at least 77% 
identical to SEQ ID NO: 1 3, and 

35 iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 
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i) a polypeptide comprising an amino acid sequence as provided in SEQ ID- 
NO: 14, 

ii) a polypeptide comprising an amino acid sequence which is at least 50% 
identical to SEQ ID NO: 14, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO: 15, . 

ii) a polypeptide comprising an amino acid sequence which is at least 64% 
identical to SEQ ID NO: 1 5, and 

iii) . a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:16, 

ii) a polypeptide comprising an amino acid sequence which is at least 50% 
identical to SEQ ID NO: 16, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

With regard to the polypeptide aspects, it will be appreciated that % identity 
figures higher than those provided above will encompass preferred embodiments. Thus, 
where applicable, in light of the minimum % identity figures, it is preferred that the 
polypeptide comprises an amino acid sequence which is at least 60%, more preferably at 
least 65%, more preferably at least 70%, more preferably at least 75%, more preferably 
at least 76%, more preferably at least 80%, more preferably at least 85%, more 
preferably at least 90%, more preferably at least 91%, more preferably at least 92%, 
more preferably at least 93%, more preferably at least 94%, more preferably at least 
95%, more preferably at least 96%, more preferably at least 97%, more preferably at 
least 98%, more preferably at least 99%, more preferably at least 99.1%,- more 
preferably at least 99.2%, more preferably at least 99.3%, more, preferably at least 
99.4%, more preferably at least 99.5%, more preferably at least 99.6%, more preferably 
at least 99.7%, more preferably at least 99.8%, and even more preferably at least 99.9% 
identical to the relevant nominated SEQ ID NO. 

Preferably, the . polypeptide can be purified from a species of the Genus 

Gossypium. 
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Preferably, the polypeptide is a fusion protein further comprising at least one 

other polypeptide sequence. 

In a preferred embodiment, the at least one other polypeptide is selected from the 
group consisting of: a polypeptide that enhances the stability of a polypeptide of the 
5 present invention, a polypeptide that assists in the purification of the fusion protein, and 
a polypeptide which assists in the polypeptide of the invention being secreted from a 

cell (particularly a plant cell). 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 
1 0 i) a sequence of nucleotides as provided in SEQ ID NO: 17; 

ii) a sequence of nucleotides as provided in SEQ ID NO: 18; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 87% identical to SEQ ID NO: 17 or 
SEQ ID NO: 18; and 

15 v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:46. 

In another aspect, the present invention provides an isolated polynucleotide 
20 comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO: 19, 

ii) a sequence of nucleotides as provided in SEQ ID NO:20, 

iii) a sequence encoding a polypeptide of the invention, and 

iv) a -sequence complementary to any one of i) to iii). 

25 In another aspect, the present invention provides an isolated polynucleotide 

comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:21; 

ii) a sequence of nucleotides as provided in SEQ ID NO:22; 

iii) a sequence encoding a polypeptide of the invention; 

30 iv) a sequence of nucleotides which is at least 54% identical to SEQ ID NO:21 or 

SEQ ID NO:22; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 

35 provided in SEQ ID NO:47. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 
i) a sequence of nucleotides as provided in SEQ ID NO:23 ; 
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ii) a sequence encoding a polypeptide of the invention; 

iii) a sequence of nucleotides which is at least 55% identical to SEQ ID NO:23; 

and 

iv) a sequence which hybridizes to any one of i) to iii) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:48. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:24; 

ii) a sequence of nucleotides as provided in SEQ ID NO:25; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:24 or 
SEQ ID NO:25; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:49. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:26; 

ii) a sequence of nucleotides as provided in SEQ ID NO:27; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:26 or 
SEQ ID NO:27; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:50. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:28, 

ii) a sequence of nucleotides as provided in SEQ ID NO:29, 

iii) a sequence encoding a polypeptide of the invention, 

iv) a sequence complementary to any one of i) to iii). 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 
i) a sequence of nucleotides as provided in SEQ ID NO:30; 
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ii) a sequence of nucleotides as provided in SEQ ID NO:31; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 65% identical to SEQ ID NO:30 or 

SEQ IDNO:31; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:5 1 . 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:32, 

ii) a sequence of nucleotides as provided in SEQ ID NO:33, 

iii) a sequence encoding a polypeptide of the invention, 

iv) a sequence complementary to any one of i) to iii). 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:34; 

ii) a sequence of nucleotides as provided in SEQ ID NO: 3 5; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 70% identical to SEQ ID NO:34 or 

SEQIDNO:35; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:52. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:36; 

ii) a sequence of nucleotides as provided in SEQ ID NO:37; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 55% identical to SEQ ID NO:36 or 

SEQ ID NO:37; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:53. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 



_„-„:j„.i u„ p iftDTO from the IFW image Database on 04/19/20iK> 



12 



i) a sequence of nucleotides as provided in SEQ ID NO:3S; 

ii) a sequence encoding a polypeptide of the invention, 

iii) a sequence of nucleotides which is at least 65% identical to SEQ ID NO:38; 

and 

iv) a sequence which hybridizes to any one of i) to iii) under high stringency 
conditions 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:39; 

ii) a sequence of nucleotides as provided in SEQ ID NO:40; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 95% identical to SEQ ID NO:39 or 
SEQ ID NO:40; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:54. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:41; 

ii) a sequence of nucleotides as provided in SEQ ID NO:42; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:41 or 
SEQ ID NO:42; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:55. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:43; 

ii) a sequence of nucleotides as provided in SEQ ID NO:44; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 65% identical to SEQ ID NO:43 or 
SEQ ID NO :44; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 
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wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:56. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 
5 i) a sequence of nucleotides as provided in SEQ ID NO:45; 

ii) a sequence encoding a polypeptide of the invention; 

iii) a sequence .of nucleotides which is at least 50% identical to SEQ ID NO:45; 

and 

iv) a sequence which hybridizes to any one of i) to iii) under high stringency 
10 conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:57. 

With regard to the polynucleotide aspects, it will be appreciated that % identity 
figures higher than those provided above will encompass preferred embodiments. Thus, 
15 where applicable, in light of the minimum % identity figures, it is preferred that the 
polynucleotide comprises a nucleotide sequence which is at least 60%, more preferably 
at least 65%, more preferably at least 70%, more preferably at least 75%, more 
preferably at least 76%, more preferably at least 80%, more preferably at least 85%, 
more preferably at least 90%, more preferably at least 91%, more preferably at least 
20 92%, more preferably at least 93%, more preferably at least 94%, more preferably at 
least' 95%, more preferably at least 96%, more preferably at least 97%, more preferably 
at least 98%, more preferably at least 99%, more preferably at least 99.1%, more 
preferably at least 99.2%, more preferably at least 99.3%, more preferably at least 
99.4%, more preferably at least 99.5%, more preferably at least 99.6%, more preferably 
25 at least 99.7%, more preferably at least 99.8%, and even more preferably at least 99.9% 
identical to the relevant nominated SEQ ID NO. 

In a further aspect, the present invention provides a catalytic polynucleotide 
capable of cleaving a polynucleotide according to the invention. 
Preferably, the catalytic polynucleotide is a ribo2yme. 
30 In yet another aspect, the present invention provides an oligonucleotide which 

comprises at least 19 contiguous nucleotides of a polynucleotide according to the 
invention. 

In another aspect, the present invention provides a double stranded RNA 
(dsRNA) molecule comprising an oligonucleotide according. to the invention, wherein 
35 the portion of the molecule that is double stranded is at least 19 basepairs in length and 
comprises said oligonucleotide. 

Preferably, the dsRNA is expressed from a single promoter, wherein the strands 
of the double stranded portion are linked by a single stranded portion. 



Pnnv oroviHAri hv I IRPTO from the IFW Image Database on 04/19/2005 



14 



In a* further aspect, the present invention provides a vector comprising or 
encoding the polynucleotide according to the invention. 

The vectors may be, for example, a plasmid, virus, transposon or phage vector 
provided with an origin of replication, and preferably a promotor for the expression of 
5 the polynucleotide and optionally a regulator of Ihe promotor. The vector may contain 
one or more selectable markers, for example an ampicillin resistance gene in the case of 
a bacterial plasmid or a neomycin resistance gene for a mammalian expression vector. 
The vector may be used in vitro, for example for the production of RNA or used to 
transfect or transform a host cell. Preferably, the vector is capable of replication in a 
10 plant cell. 

Preferably, the polynucleotide is operably linked to a plant ovule or fibre specific 
promoter. 

In another aspect, the present invention provides a vector comprising or encoding 
oligonucleotide of the invention or the dsRNA molecule of the invention. 
15 i n a further aspect, the present invention provides a host cell comprising a vector 

according to the invention. 

In a further aspect, the present invention provides a transgenic plant, the plant 
having been transformed with polynucleotide according to the invention or an 
oligonucleotide of the invention. 
20 In one embodiment, the polynucleotide is capable of expression to produce a 

polypeptide according to the invention. 

In an alternate embodiment, the plant has been transformed such that it produces 
a catalytic polynucleotide of the invention, or a dsRNA molecule of the invention. 

In a further embodiment, the polynucleotide, catalytic polynucleotide or dsRNA 
25 down-regulates the production of a polypeptide of the invention which is endogenously 

produced by the plant. 

In a further aspect, the present invention provides a substantially purified 
antibody, or fragment thereof, that specifically binds a polypeptide of the invention. 

In another aspect, the present invention provides a method of breeding a fibre 
30 producing plant, the method comprising performing a method according to the first, 
second or third aspects of the invention. 

In a further aspect, the present invention provides a method of selecting from a 
breeding population a fibre producing plant with altered fibre initiation and/or 
elongation potential, the method comprising; 
35 i) crossing two plants which have differing potential to produce fibre, 

ii) performing a method according to the first, second or third aspects of the 
invention on progeny plants, 



Coov provided bv USPTO from the IFW Image Database on 04/19/2005 



15 



iii) selecting a progeny plant with altered fibre initiation and/or elongation 
potential when compared to a parent plant. 

In another aspect, the present invention provides a plant produced by a method of 

the invention. 

5 In a further aspect, the present invention provides a seed of a plant, transgenic or 

otherwise, of the invention. 

In a further aspect, the present invention provides fibre of a plant, transgenic or 

otherwise, of the invention. 

In another aspect, the present invention provides a method of identifying an 
10 agent which alters fibre initiation and/or elongation of a fibre producing plant, the 
method comprising 

a) exposing a polypeptide which is at least 50% identical to any one of SEQ ID 
NO's:l to 16 to a candidate agent, and 

b) assessing the ability of the candidate agent to modulate the activity of the 

15 polypeptide. 

In a further aspect, the present invention provides a method of identifying an 
agent which alters fibre initiation and/or elongation of a fibre producing plant, the 
method comprising 

a) exposing a polypeptide which is at least 50% identical to any one of SEQ ID 
20 NO'srl to 16 to a binding partner which binds the polypeptide, and a candidate agent, 

and 

b) assessing the ability of the candidate agent to compete with the binding 
partner for binding to the polypeptide. 

Preferably, the binding partner is detectably labeled. 
25 in a further aspect, the present invention provides a method of identifying an 

agent which alters fibre initiation and/or elongation of a fibre producing plant, the 
method comprising 

a) exposing a polynucleotide encoding a polypeptide which is at least 50% 
identical to any one of SEQ ID NO's:l to 16 to a candidate agent under conditions 

30 which allow expression of the polynucleotide, and 

b) assessing the ability of the candidate agent to modulate levels of polypeptide 
produced by the polynucleotide. 

In another aspect, the present invention provides a method of identifying an 
agent which alters fibre initiation and/or elongation of a fibre producing plant, the 

35 method comprising 

a) exposing a polynucleotide which is at least 50% identical to any one of SEQ 

ID NO's:17 to 45 to a candidate agent, and 
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b) assessing the ability of the candidate agent to hybridize and/or cleave the 
polynucleotide. 

As will be apparent, preferred features and characteristics of one aspect of the 
invention are applicable to many other aspects of the invention. 

Throughout this specification the word "comprise", or variations such as 
"comprises" or "comprising", will be understood to imply the inclusion of a stated 
element, integer or step, or group of elements, integers or steps, but not the exclusion of 
any other element, integer or step, or group of elements, integers or steps. 

The invention is hereinafter described by way of the following non-limiting 
Examples and with reference to the accompanying figures. 

ttRTEF DESCRIPTION OF THE ACC OMPANYING DRAWINGS 
Figure 1. Expression profiles of the candidate genes measured by microarrays. The 
plots on the left side (column A) represent the results from the DPI 6 time course 
experiment and the values plotted are the ratios relative to 0 dpa. The plots on the right 
side (column B) represent the results of multi-time point Lintless 4 A/DP 16 
comparisons and the values plotted are the ratios of Lintless 4A/DP16 at the 
corresponding dpa. The genes showing similar expression profiles are plotted together. 
Figure 2. RT PCR of GhMyb25 with (3-tubulin as control. 

a. Different tissues from DP 16. -Co: negative control, reaction without reverse 
transcriptase; O: 0 dpa ovule; P: petal; L: leaf; R: root; S: stem. 

b. Ovule and fibre from DPI 6. The numbers indicate the corresponding dpa. 

c. Ovule from 3 mutants. The numbers indicated the corresponding dpa. 

d. Southern blotting of the c. hybridised with an ON035F4 probe. 

Figure 3. RT-PCR of GhHDl with (3-tubulin as control. -Co: negative control, reaction 
without reverse transcriptase; -4 to 4: DP16 ovules of various stages (dpa) as indicated 
by the corresponding number; SO: 8 dpa ovule of DPI 6 after fibres being removed; 8F: 
8 dpa detached fibres of DPI 6; 0-5B: 0 dpa ovule form mutant 5B; 0-4A: 0 dpa ovule 
from mutant 4A; L: leaf; P: petal; R: root; H: hypocotyl. 

Figure 4. Histograms of relative DNA contents of ovule epidermal cells and fibre cells. 
ktt v TO THE SEQU ENCE LISTING 

SEQ ID NO:l - Partial homeodomain like protein encoded by GhHDl cDNA (clone 
ON033M7). 

SEQ ID NO:2 - Myb transcription factor like protein encoded by GhMyb25 cDNA 
(clone ON035F4). 

SEQ ID NO:3 - Partial cyclin D3 like protein encoded by GhCycD3;l cDNA (clone 
OCF07F4). 
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SEQ ID NO:4 - Partial protein encoded by GhFaEl cDNA (clone ON035N9). 
SEQ ID NO:5 - Possible partial protein encoded by GhFUl cDNA (clone ON035C9). 
SEQ ID NO:6 - Possible partial protein encoded by GhFU2 cDNA (clone ON005F1). 
SEQ ID NO:7 - a-expansin like protein cDNA (encoded in part by clone Pfsl4x). 
5 SEQ ID NO:8 - Partial protein encoded by GhTMTP cDNA (clone CHX015K18). 

SEQ ID NO:9 - Sucrose synthase gene encoded by Ghsus cDNA (clone CHX002C10). 
SEQ ID NO: 10 - Partial protein encoded by GhLTP cDNA (clone ON033M19). 
SEQ ID NO:l 1 - Protein encoded by GhLTP2 cDNA (clone OCF010D8). 
SEQ ID NO: 12 - Partial protein encoded by GhMyb25-like cDNA (clone ON038N8). 
10 SEQ ID NO: 1 3 - Protein encoded by GhRD22 cDNA (clone OCF005C10). 

SEQ ID NO: 14 - Protein encoded by GhRD22-like cDNA (clone OCF010D8). 
SEQ ID NO: 15 - Partial protein encoded by GhAsp cDNA (clone OCF008G9). 
SEQ ID NO: 16 - Partial protein encoded by cDNA clone CHX007D1. 
SEQ ID NO: 17 - GhHDl cDNA (clone ON033M7). 
15 SEQ ID NO:18 - Coding region of GhHDl cDNA (clone ON033M7). 
SEQ ID NO: 19 - GhMyb25 cDNA (clone ON035F4). 
SEQ ID NO:20 - Coding region for GhMyb25 cDNA (clone ON035F4). 
SEQ ID NO:21 - GhCycD3;l cDNA (clone OCF07F4). 
SEQ ID NO:22 - Coding region of GhCycD3;l cDNA (clone OCF07F4). 
20 SEQ ID NO:23 - GhFaEl cDNA (clone ON035N9) (entire clone coding region). 
SEQ ID NO:24 - GhFUl cDNA (clone ON035C9). 
SEQ ID NO:25 - Coding region of GhFUl cDNA (clone ON035C9). 
SEQ ID NO:26 - GhFU2 cDNA (clone ON005F1). 
SEQ ID NO:27 - Coding region of GhFU2 cDNA (clone ON005F1). 
25 SEQ ID NO:28 - a-expansin like clone Pfsl4x. 

SEQ ID NO:29 - Coding region of a-expansin like clone Pfsl4x. 
SEQ ID NO:30 - GhTMTP cDNA (clone CHX015K18). 
SEQ ID NO:31 - Coding region for GhTMTP cDNA (clone CHX015K18). 
SEQ ID NO:32 - Ghsus cDNA (clone CHX002C10). 
30 SEQ ID NO:33 - Coding region of Ghsus cDNA (clone CHX002C10). 
SEQ ID NO:34 - GhLTP cDNA (clone ON033M1 9). 
SEQ ID NO:35 - Coding region for GhLTP cDNA (clone ON033M19). 
SEQ ID NO:36 - GhLTP2 cDNA (clone OCF010D8). 
SEQ ID NO:37 - Coding region for GhLTP2 cDNA (clone OCF010D8). 
35 SEQ ID NO:38 - GhMyb25-like cDNA (clone ON038N8) (entire clone coding region). 
SEQ ID NO:39 - GURD22 cDNA (clone OCF005C10). 
SEQ ID NO:40 - Coding region for GhRD22 cDNA (clone OCF005C10). 
SEQ ID NO:41 - GhRD22-like cDNA (clone OCF010D8). 
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SEQ ID NO:42 - Coding region for GhRD22-like cDNA (clone OCF010D8). 

SEQ ID NO:43 -GhAsp cDNA (clone OCF008G9). 

SEQ ID NO:44 - Coding region for GhAsp cDNA (clone OCF008G9). 

SEQ ID NO:45 - cDNA clone CHX007D1 (entire clone coding region). 
5 SEQ ID NO:46- Cotton EST BE052193. 

SEQ ID NO:47 - Cotton EST BQ4 1 2597. 

SEQ ID NO:48 - Cotton EST AI731943. 

SEQ ID NO:49 - Cotton EST BG442467. 

SEQ ID NO:50 - Cotton EST BQ403714. 
1 0 SEQ ID NO:5 1 - Cotton EST BG443329. 

SEQ ID NO: 52 - Cotton EST BF275 1 77. 

SEQ ID NO:53 - Cotton EST BQ410140. 

SEQ ID NO:54 - Cotton EST CA993037. 

SEQ ID NO:55 - Cotton EST BG441493. 
1 5 SEQ ID NO:56 - Cotton EST BQ402375. 

SEQ ID NO:57 - Cotton EST BQ413582. 

SEQ ID NO's:58 to 64 - Stem loop sequences of dsRNA molecules. 
SEQ ID NOV.65 to 72 - Oligonucleotide primers. 

20 MTAOD PESCRgTI ON OF THE INVENTION 

General Techniques and Definitions 

Unless specifically defined otherwise, all technical and scientific terms used 
herein shall be taken to have the same meaning as commonly understood by one of 
ordinary skill in the art (e.g., in cell culture, molecular genetics, plant physiology and 
25 biochemistry, immunohistochemistry, protein chemistry, and biochemistry). 

Unless otherwise indicated, the recombinant protein, cell culture and others 
methods utilized in the present invention are standard procedures, well known to those 
skilled in the art. Such techniques are described and explained throughout the literature 
in sources such as, J. Perbal, A Practical Guide to Molecular Cloning, John Wiley and 
30 Sons (1 984), J. Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbour Laboratory Press (1989), T. A. Brown (editor), Essential Molecular Biology: A 
Practical Approach, Volumes 1 and 2, IRL Press (1991), D.M. Glover and B.D. Hames 
(editors), DNA Cloning: A Practical Approach, Volumes 1-4, IRL Press (1995 and 
1996), and F.M. Ausubel et al., (editors), Current Protocols in Molecular Biology, 
35 Greene Pub. Associates and Wiley-Interscience (1988, including all updates until 
present), Ed Harlow and David Lane (editors) Antibodies: A Laboratory Manual, Cold 
Spring Harbour Laboratory, (1988), and J.E. Coligan et al., (editors) Current Protocols 
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in Immunology, John Wiley & Sons (including all updates until present), and are 
incorporated herein by reference. 

As used herein, the term "gene" is to be taken in its broadest context and includes 
the deoxyribonucleotide sequences comprising the protein coding region of a structural 
gene and including sequences located adjacent to the coding region on both the 5' and 3' 
ends for a distance of about 1 kb on either end such that the gene corresponds to the 
length of the full-length mRNA. The sequences which are located 5' of the coding 
region and which are present on the mRNA are referred to as 5 1 non-translated 
sequences. The sequences which are located 3' or downstream of the coding region and 
which are present on the mRNA are referred to as 3' non-translated sequences; these 
sequences. The term "gene" encompasses both cDNA and genomic forms of a gene. A 
genomic form or clone of a gene contains the coding region which may be interrupted 
with non-coding sequences termed "introns" or "intervening regions" or "intervening 
sequences." Introns are segments of a gene which are transcribed into nuclear RNA 
15 (hnRNA); introns may contain regulatory elements such as enhancers. Introns are 
removed or "spliced out" from the nuclear or primary transcript; introns therefore are 
absent in the messenger RNA (mRNA) transcript. The mRNA functions during 
translation to specify the sequence or order of amino acids in a nascent polypeptide. 
The term "gene" includes a synthetic or fusion molecule encoding all or part of the 
20 proteins of the invention described herein and a complementary nucleotide sequence to 

any one of the above. 

A "polymorphism" as used herein denotes a variation in the nucleotide sequence 
between genes of the invention, of different species, cultivars, strains or individuals of a 
plant. A "polymorphic position" is a preselected nucleotide position within the 
25 sequence of the gene. In some cases, genetic polymorphisms are reflected by an amino 
acid sequence variation, and thus a polymorphic position can result in location of a 
polymorphism in the amino acid sequence at a predetermined position in the sequence 
of a polypeptide. Typical polymorphisms are deletions, insertions or substitutions. 
These can involve a single nucleotide (single nucleotide polymorphism or SNP) or two 

30 or more nucleotides. 

A "deletion," as used herein, refers to a change in either amino acid or nucleotide 
sequence in which one or more amino acid or nucleotide residues, respectively, are 
absent. 

An "insertion" or "addition," as used herein, refers to a change in an amino acid 
35 or nucleotide sequence resulting in the addition of one or more amino acid or nucleotide 

residues, respectively. 

A "substitution," as used herein, refers to the replacement of one or more amino 
acids or nucleotides by different amino acids or nucleotides, respectively. 
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By "linked" or "genetically linked" it is meant that a marker locus and a second 
locus are sufficiently close on a chromosome that they will be inherited together in more 
than 50% of meioses, e.g., not randomly. Thus, the percent of recombination observed 
between the loci per generation (centimorgans (cM)), will be less than 50. In particular 
embodiments of the invention, genetically linked loci may be 45, 35, 25, 15, 10, 5, 4, 3, 
2, or 1 or less cM apart on a chromosome. Preferably, the markers are less than 5 cM 
apart and most preferably about 0 cM apart. 

The term "fibre" refers to plant cell types that share in common the features of 
having an elongated shape and abundant cellulose in thick cell walls, usually, but not 
always, described as secondary walls. Such walls may or may not be lignified, and the 
protoplast of such cells may or may not remain alive at maturity. Here the term "fibre" 
is used in its most inclusive sense, for example including: (a) thick-walled conducting 
and non-conducting cells of the xylem; (b) fibres of extraxylary origin, including those 
from ovary including the outer integument of the ovary, phloem, bark, ground tissue, 
and epidermis; and (c) fibres from stems, leaves, roots, seeds, and flowers or 
inflorescences. Such fibres have many industrial .uses, for example in textiles, paper, 
sacking and boxing material, cordage, brushes and brooms, filling and stuffing, 
caulking, reinforcement of other materials, and manufacture of cellulose derivatives. In 
some industries, the term "fibre" is usually inclusive of thick-walled conducting cells 
20 such as vessels and tracheids and to fibrillar aggregates of many individual fibre cells. 
In a preferred embodiment, cotton fibre refers to the lint produced from the cotton boll 
(seed capsule) that is produced commercially. Cotton fibre also includes the short (about 
several mm) fibres sometimes referred to as "fuzz fibres". Preferred fibre producmg 
plants include, but are not limited to, cotton (such as Gossypium arboreum, Gossypium 
25 herbaceum, Gossypium barbadense and Gossypium hirsutuni), silk cotton tree (Kapok, 
Ceiba pentandra), desert willow, creosote bush, winterfal, balsa, ramie, kenaf, hemp 
(Cannabis sativa), roselle, jute, sisal abaca, flax, and horticultural plants such as.grape, 

peach, pear, and apple. 

As used herein, a "wild-type plant" is a plant that has not been altered by a 
30 - method of the invention and/or does not comprise a transgene of the invention. When 
performing a method of the invention for altering fibre initiation and/or elongation, the 
manipulated plant is compared to a non-manipulated ("wild-type") member of the same 
species to determine the impact of the manipulation on fibre initiation and/or elongation. 
In one embodiment, the term "altering fibre initiation and/or elongation" refers to 
35 increasing the number and/or size of the fibres. In another embodiment, the term 
"altering fibre initiation and/or elongation" refers to decreasing the number and/or size 
of the fibres. In a further embodiment, the term "altering fibre initiation and/or 
elongation" refers to modifying the timing of fibre initiation and/or elongation during 
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development of the plant, for example to promote earlier or delayed initiation, or to 
regulate the synchrony of fibre initiation. 

Detection of impaired genes and/ or gene expression levels 

Any molecular biological technique known in the art which is capable of 
detecting a polymorphism/mutation/genetic variation or differential gene expression can 
be used in the methods of the present invention. Such methods include, but are not 
limited to, the use of nucleic acid amplification, nucleic acid sequencing, nucleic acid 
hybridization with suitably labeled probes, single-strand conformational analysis 
(SSCA), denaturing gradient gel electrophoresis (DGGE), heteroduplex analysis (HET), 
chemical cleavage analysis (CCM), catalytic nucleic acid cleavage, or a combination 
thereof (see, for example, Lemieux, 2000). The invention also includes the use of 
molecular marker techniques to detect polymorphisms closely linked to genes of the 
invention. Such methods include the detection or analysis of restriction fragment length 
polymorphisms (RFLP), RAPD, amplified fragment length polymorphisms (AFLP) and 
microsatellite (simple sequence repeat, SSR) polymorphisms. The closely linked 
markers can be obtained readily by methods well known in the art, such as Bulked 
Segregant Analysis. 

The "polymerase chain reaction" ("PCR") is a reaction in which replicate copies 
are made of a target polynucleotide using a "pair of primers" or "set of primers" 
consisting of "upstream" and a "downstream" primer, and a catalyst of polymerization, 
such as a DNA polymerase, and typically a thermally-stable polymerase enzyme. 
Methods for PCR are known in the art, and are taught, for example, in "PCR" (Ed. M.J. 
McPherson and S.G Moller (2000) BIOS Scientific Publishers Ltd, Oxford). PCR can 
be performed on cDNA obtained from reverse transcribing mRNA isolated from plant 
cells expressing, or that should be expressing, a gene of the invention. However, it will 
generally be easier if PCR is performed on genomic DNA isolated from a plant. 

A primer is an oligonucleotide, usually of about 20 nucleotides long, with a 
minimum of about 15 and a maximum of about 50 nucleotides, that is capable of 
hybridising in a sequence specific fashion to the target sequence and being extended 
during the PCR. Amplicons or PCR products or PCR fragments or amplification 
products are extension products that comprise the primer and the newly synthesized 
copies of the target sequences. Multiplex PCR systems contain multiple sets of primers 
that result in simultaneous production of more than one amplicon.. Primers may be 
perfectly matched to the target sequence or they may contain internal mismatched bases 
that can result in the induction of restriction enzyme or catalytic nucleic acid 
recognition/cleavage sites in specific target sequences. Primers may also contain 
additional sequences and/or modified or labelled nucleotides to facilitate capture or 
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detection of amplicons. Repeated cycles of heat denaturation of the DNA, annealing of 
primers to their complementary sequences and extension of the annealed primers with 
polymerase result in exponential amplification of the target sequence. The terms target 
or target sequence or template refer to nucleic acid sequences which are amplified. 

Methods for direct sequencing of nucleotide sequences are well known to those 
skilled in the art and can be found for example in Ausubel et al., eds., Short Protocols in 
Molecular Biology, 3rd ed., Wiley, (1995) and Sambrook et al., Molecular Cloning, 2nd 
ed., Chap. 13, Cold Spring Harbor Laboratory Press, (1989). Sequencing can be carried 
out by any suitable method, for example, dideoxy sequencing, chemical sequencing or 
variations thereof. Direct sequencing has the advantage of determining variation in any 
base pair of a particular sequence. 

Hybridization based detection systems include, but are not limited to, the 
TaqMan assay and molecular beacons. The TaqMan assay (US 5,962,233) uses allele 
specific (ASO) probes with a donor dye on one end and an acceptor dye on the other 
end such that the dye pair interact via fluorescence resonance energy transfer (FRET). 
A target sequence is amplified by PCR modified to include the addition of the labeled 
ASO probe. The PCR conditions are adjusted so that a single nucleotide difference will 
effect binding of the probe. Due to the 5' nuclease activity of the Taq polymerase 
enzyme, a perfectly complementary probe is cleaved during PCR while a probe with a 
single mismatched base is not cleaved. Cleavage of the probe dissociates the donor dye 
from the quenching acceptor dye, greatly increasing the donor fluorescence. 

An alternative to the TaqMan assay is the molecular beacon assay (US 
5,925,517). In the molecular beacon assay, the ASO probes contain complementary 
sequences flanking the target specific species so that a hairpin structure is formed. The 
loop of the hairpin is complimentary to the target sequence while each arm of the 
hairpin contains either donor or acceptor dyes. When not hybridized to a . donor 
sequence, the hairpin structure brings the donor and acceptor dye close together thereby 
extinguishing the donor fluorescence. When hybridized to the specific target sequence, 
however, the donor and acceptor dyes are separated with an increase in fluorescence of 
up to 900 fold. Molecular beacons can be used in conjunction with amplification of the 
target sequence by PCR and provide a method for real time detection of the presence of 
target sequences or can be used after amplification. 

Marker assisted selection is a well recognised method of selecting for 
heterozygous plants required when backcrossing with a recurrent parent in a classical 
breeding program. The population of plants in each backcross generation will be 
heterozygous for the gene of interest, normally present in a 1:1 ratio in a backcross 
population, and the molecular marker can be used to distinguish the two alleles. By 
extracting DNA from, for example, young leaves or shoots and testing with a specific 
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marker for the introgressed desirable trait, early selection of plants for further 
backcrossing is made whilst energy and resources are concentrated on fewer plants. 

Polypeptides 

By "substantially purified polypeptide" we mean a polypeptide that has been at 
least partially separated from the lipids, nucleic acids, other polypeptides, and other 
contaminating molecules with which it is associated in its native state. Preferably, the 
substantially purified polypeptide is at least 60% free, preferably at least 75% free, and 
most preferably at least 90% free from other components with which they are naturally 
associated. Furthermore, the term "polypeptide" is used interchangeably herein with the 
term "protein". 

The % identity of a polypeptide is determined by GAP (Needleman and Wunsch, 
1970) analysis (GCG program) with a gap creation penalty=5, and a gap extension 
penalty=0.3. Unless stated otherwise, the query sequence is at least 15 amino acids in 
length, and the GAP analysis aligns the two sequences over a region of at least 15 amino 
acids. More preferably, the query sequence is at least 50 amino acids in length, and the 
GAP analysis aligns the two sequences over a region of at least 50 amino acids. Even 
more preferably, the query sequence is at least 100 amino acids in length and the GAP 
analysis aligns the two sequences over a region of at least 100 amino acids. 

Amino acid sequence mutants of the polypeptides of the present invention can be 
prepared by introducing appropriate nucleotide changes into a nucleic acid of the 
present invention, or by in vitro synthesis of the desired polypeptide. Such mutants 
include, for example, deletions, insertions or substitutions of residues within the amino 
acid sequence. A combination of deletion, insertion and substitution can be made to 
arrive at the final construct, provided that the final protein product possesses the desired 
characteristics. 

In designing amino acid sequence mutants, the location of the mutation site and 
the nature of the mutation will depend on characteristic(s) to be modified. The sites for 
mutation can be modified individually or in series, e.g., by (1) substituting first with 
conservative amino acid choices and then with more radical selections depending upon 
the results achieved, (2) deleting the target residue, or (3) inserting other residues 
adjacent to the located site. 

Amino acid sequence deletions generally range from about 1 to 30 residues, 
more preferably about 1 to 10 residues and typically about 1 to 5 contiguous residues. 

Substitution mutants have at least one amino acid residue in the polypeptide 
molecule removed and a different residue inserted in its place. The sites of greatest 
interest for substitutional mutagenesis include sites identified as the active or binding 
site(s). Other sites of interest are those in which particular residues obtained from 
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various strains or species are identical. These positions may be important for biological 
activity These sites, especially those falling within a sequence of at least three other 
identically conserved sites, are preferably substituted in a relatively conservative 
manner. Such conservative substitutions are shown in Table 1 . 



TABLE 1 . Exemplary substitutions. 
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Lys (K) 


arg 


Met (M) 


leu; phe 


Phe (F) 


leu; val; ala 


Pro (P) 


siy ■ 


Ser (S) 


thr 


Thr(T) 


ser 


Trp (W) . 


-Jze : 


Tvr(Y) 


trp; phe 


Val (V) 


ile; leu; met; phe, ala 



Furthermore, if desired, unnatural amino acids or chemical amino acid analogues 
10 can be introduced as a substitution or addition into the polypeptides of the present 
invention. Such amino acids include, but are not limited to, the D-iSomers of the 
common amino acids, 2,4-diaminobutyric acid, a-amino isobutyric acid, 4-ammobutync 
acid, 2-aminobutyric acid, 6-amino hexanoic acid, 2-amino isobutync acid, 3-amino 
propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrullme, 
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homocitrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, 
cyclohexylalanine, (3-alanine, fluoro-amino acids, designer amino acids such as p- 
methyl amino acids, Cd-methyl amino acids, Na-methyl amino acids, and amino acid 
analogues in general. 

5 Also included within the scope of the invention are polypeptides of the present 

invention which are differentially modified during or after synthesis, e.g., by 
biotinylation, ben2ylation, glycosylation, acetylation, phosphorylation, amidation, 
derealization by known protecting/blocking groups, proteolytic cleavage, linkage to an 
antibody molecule or other cellular ligand, etc. ' These modifications may serve to 
1 0 increase the stability and/or bioactivity of the polypeptide of the invention. 

Polypeptides of the present invention can be produced in a variety of ways, 
including production and recovery of natural proteins, production and recovery of 
recombinant proteins, and chemical synthesis of the proteins. In one embodiment, an 
isolated polypeptide of the present invention is produced by culturing a cell capable of 
15 expressing the polypeptide under conditions effective to produce the polypeptide, and 
recovering the polypeptide. A preferred cell to culture is a recombinant cell of the 
present invention. Effective culture conditions include, but are not limited to, effective 
media, bioreactor, temperature, pH and oxygen conditions that permit protein 
production. An effective medium refers to any medium in which a cell is cultured to 
20 produce a polypeptide of the present invention. Such medium typically comprises an 
aqueous medium having assimilable carbon, nitrogen and phosphate sources, and 
appropriate salts, minerals, metals and other nutrients, such as vitamins. Cells of the 
present invention can be cultured in conventional fermentation bioreactors, shake flasks, 
test tubes, microliter dishes, and petri plates. Culturing can be carried out at a 
25 temperature, pH and oxygen content appropriate for a recombinant cell. Such culturing 
conditions are within the expertise of one of ordinary skill in the art. 

In a preferred embodiment, the polypeptides of the invention are transcription 
factors, regulatory proteins, or proteins that regulate the cell-cycle in the fibre producing 
plant. The transcription factors may be Myb transcription factors or homeodomain 
30 containing transcription factors, which are classes well known in the art. 

Polynucleotides 

By an "isolated polynucleotide", including DNA, RNA, or a combination of 
these, single or double stranded, in the sense or antisense orientation or a combination of 
35 both, dsRNA or otherwise, we mean a polynucleotide which is at least partially 
separated from the polynucleotide sequences with which it is associated or linked in its 
native state. Preferably, the isolated polynucleotide is at least 60% free, preferably at 
least 75% free, and most preferably at least 90% free from other components with which 



Ho™ ZZZZaZZ hv IISPTO from the IFW Image Database on 04/19/2005 



26 



they are naturally associated. Furthermore, the term "polynucleotide" is used 
interchangeably herein with the term "nucleic acid molecule". 

The % identity of a polynucleotide is determined by GAP (Needleman and 
Wunsch, 1970) analysis (GCG program) with a gap creation penalty=5, and a gap 
5 extension penalty=0.3. Unless stated otherwise, the query sequence is at least 45 
nucleotides in length, and the GAP analysis aligns the two sequences over a region of at 
least 45 nucleotides. Preferably, the query sequence is at least 150 nucleotides in 
length, and the GAP analysis aligns the two sequences over a region of at least 150 
nucleotides. More preferably, the query sequence is at least 300 nucleotides in length 
1 0 and the GAP analysis aligns the two sequences over a region of at least 300 nucleotides. 

Oligonucleotides of the present invention can be RNA, DNA, or derivatives of 
either. The minimum size of such oligonucleotides is the size required for the formation 
of a stable hybrid between an oligonucleotide and a complementary sequence on a 
nucleic acid molecule of the present invention. Preferably, the oligonucleotides are at 
15 least 15 nucleotides, more preferably at least 18 nucleotides, more preferably at least 19 
nucleotides, more preferably at least 20 nucleotides, even more preferably at least 25 
nucleotides in length. The present invention includes oligonucleotides that can be used 
as, for example, probes to identify nucleic acid molecules, primers to produce nucleic 
acid molecules or as agents to modify fibre initiation and/or elongation (e.g., as 
20 antisense-, triplex formation-, ribozyme- and/or RNA drug-based reagents). 
Oligonucleotide of the present invention used as a probe are typically conjugated with a 
label such as a radioisotope, an enzyme, biotin, a fluorescent molecule or a 
chemiluminescent molecule. 

Polynucleotides and oligonucleotides of the present invention include those 
25 which hybridize under stringent conditions to a sequence provided as SEQ ID NO's: 17 
to 45- As used herein, stringent conditions are those that (1) employ low ionic strength 
and high temperature for washing, for example, 0.015 M NaCl/0.0015 M sodium 
citrate/0.1% NaDodS0 4 at 50°C; (2) employ during hybridisation a denaturing agent 
such as formamide, for example, 50% (vol/vol) formamide with 0.1% bovine serum 
30 albumin, 0.1% Ficoll, 0.1% polyvinylpyrrolidone, 50 mM sodium phosphate buffer at 
pH 6.5 with 750 mM NaCl, 75 mM sodium citrate at 42°C; or (3) employ 50% 
formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate 
(pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm 
DNA (50 g/ml), 0.1% SDS and 10% dextran sulfate at 42°C in 0.2 x SSC and 0.1% 
35 SDS. 

Polynucleotides of the present invention may possess, when compared to 
naturally occurring molecules, one or more mutations which are deletions, insertions, or 
substitutions of nucleotide residues. Mutants can be either naturally occurring (that is to 



Coov otreM'W Ku MSPTO from the IFW Image Database on 04/19/2005 



27 



say, isolated from a natural source) or synthetic (for example, by performing site- 
directed mutagenesis on the nucleic acid). 

Antisense Polynucleotides 
5 ~ The term "antisense nucleic acid" shall be taken to mean DNA or RNA, or 

combination thereof, molecule that is complementary to at least a portion of a specific 
mRNA molecule of the invention and capable of interfering with a post-transcriptional 
event such as mRNA translation. . The use of antisense methods is well known in the 
art (see for example, G. Hartmann and S. Endres, Manual of Antisense Methodology, 
1 0 Kluwer ( 1 999)). The use of antisense techniques in plants has been reviewed by 

Bourque (1995) and Senior (1998). Bourque lists a large number of examples of how 
antisense sequences have been utilized in plant systems as a method of gene 
inactivation. She also states that attaining 100% inhibition of any enzyme activity may 
not be necessary as partial inhibition will more than likely result in measurable change 
1 5 in the system. Senior (1998) states that antisense methods are now a very well 
established technique for manipulating gene expression. 

Antisense molecules may include sequences that correspond to the structural 
genes or for sequences that effect control over the gene expression or splicing event. 
For example, the antisense sequence may correspond to the targeted coding region of 
20 the genes of the invention, or the 5 '-untranslated region (UTR) or the 3 '-UTR or 

combination of these. It may be complementary in part to intron sequences, which may 
be spliced out during or after transcription, preferably only to exon sequences of the 
target gene. In view of the generally greater divergence of the UTRs, targeting these 
regions provides greater specificity of gene inhibition. The length of the antisense 
25 sequence should be at least 19 contiguous nucleotides, preferably at least 50 nucleotides, 
and more preferably at least 100, 200, 500 or 1 000 nucleotides. The full-length sequence 
complementary to the entire gene transcript may be used. The length is most preferably 
100-2000 nucleotides. The degree of homology of the antisense sequence to the 
targeted transcript should be at least 85%, preferably at least 90% and more preferably 
30 95-100%. The antisense RNA molecule may of course comprise unrelated sequences 
which may function to stabilize the molecule. 

Catalytic Polynucleotides 

The term catalytic polynucleotide/nucleic acid refers to a DNA molecule or 
35 DNA-containing molecule (also known in the art as a "deoxyribozyme") or an RNA or 
RNA-containing molecule (also known as a "ribozyme") which specifically recognizes a 
distinct substrate and catalyzes the chemical modification of this substrate. The nucleic 
acid bases in the catalytic nucleic acid can be bases A, C, G, T (and U for RNA). 
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Typically, the catalytic nucleic acid contains an antisense sequence for specific 
recognition of a target nucleic acid, and a nucleic acid cleaving enzymatic activity (also 
referred to herein as the "catalytic domain"). The types of ribozymes that are 
particularly useful in this invention are the hammerhead ribozyme (Haseloff and 
Gerlach, 1988, Perriman et al., 1992) and the hairpin ribozyme (Shippy et al., 1999). 

The ribozymes of this invention and DNA encoding the ribozymes can be 
chemically synthesized using methods well known in the art. The ribozymes can also 
be prepared from a DNA molecule (that upon transcription, yields an RNA molecule) 
operably linked to an RNA polymerase promoter, e.g., the promoter for T7 RNA 
polymerase or SP6 RNA polymerase. Accordingly, also provided by this invention is a 
nucleic acid molecule, i.e., DNA or cDNA, coding for the ribozymes of this invention. 
When the vector also contains an RNA polymerase promoter operably linked to the 
DNA molecule, the ribozyme can be produced in vitro upon incubation with RNA 
polymerase and nucleotides. In a separate embodiment, the DNA can be inserted into 
an expression cassette or transcription cassette. After synthesis, the RNA molecule can 
be modified by ligation to a DNA molecule having the ability to stabilize the ribozyme 
and make it resistant to RNase. ' 

RNA interference 

RNA interference (RNAi) is particularly useful for specifically inhibiting the 
production of a particular protein. Although not wishing to be limited by theory, 
Waterhouse et al. (1998) have provided a model for the mechanism by which dsRNA 
can be used to reduce protein production. . This technology relies on the presence of 
dsRNA molecules that contain a sequence that is essentially identical to the mRNA of 
the gene of interest or part thereof, in this case an mRNA encoding a polypeptide 
according to the invention. Conveniently, the dsRNA can be produced from a single 
promoter in a recombinant vector or host cell, where the sense and anti-sense sequences 
are flanked by an unrelated sequence which enables the sense and anti-sense sequences 
to hybridize to form the dsRNA molecule with the unrelated sequence forming a loop 
structure. The design and production of suitable dsRNA molecules for the present 
invention is well within the capacity of a person skilled in the art, particularly 
considering Waterhouse et al. (1998), Smith et al. (2000), WO 99/32619, WO 99/53050, 
WO 99/49029, and WO 01/34815. In this method a DNA is introduced that directs the 
synthesis of an at least partly double stranded RNA produces) with homology to the 
target gene to be inactivated. The DNA therefore comprises both sense and antisense 
sequences that, when transcribed into RNA, can hybridize to form the double-stranded 
RNA region. In a preferred embodiment, the sense and antisense sequences are 
separated by a spacer region that comprises an intron which, when transcribed into 
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RNA, is spliced out. This arrangement has been shown to result in a higher efficiency 
of gene silencing. The double-stranded region may comprise one or two RNA 
molecules, transcribed from either one DNA region or two. The presence of the double 
stranded molecule is thought to trigger a response from an endogenous plant system that 
destroys both the double stranded RNA and also the homologous RNA transcript from 
the target plant gene, efficiently reducing or eliminating the activity of the target gene. 
The length of the sense and antisense sequences that hybridise should each be at least 19 
contiguous nucleotides, preferably at least 30 or 50 nucleotides, and more preferably at 
least 100, 200, 500 or 1000 nucleotides. The full-length sequence corresponding to the 
entire gene transcript may be used. The lengths are most preferably 100-2000 
nucleotides. The degree of homology of the sense and antisense sequences to the 
targeted transcript should be at least 85%, preferably at least 90% and more preferably 
95-100%. The RNA molecule may of course comprise unrelated sequences which may 
function to stabilize the molecule. The RNA molecule may be expressed under the 
control of a RNA polymerase II or RNA polymerase III promoter. Examples of the 
latter include tRNA or snRNA promoters. 

Preferred small interfering RNA ('siRNA") molecules comprise a nucleotide 
sequence that is identical to about 19-21 contiguous nucleotides of the target mRNA. 
Preferably, the target mRNA sequence commences with the dinucleotide AA, comprises 
a GC-content of about 30-70% (preferably, 30-60%, more preferably 40-60% and more 
preferably about 45%-55%), and does not have a high percentage identity to any 
nucleotide sequence other than the target in the genome of the plant (preferably cotton) 
in which it is to be introduced, e.g., as determined by standard BLAST search. 

Preferred loop ("single stranded") sequences are selected from, but not limited to, 
the group consisting of: 

(i) CCC(SEQIDNO:58); 
(if) UUCG (SEQ ID NO: 59); 

(iii) CCACC (SEQ ID NO:60); 

(iv) CUCGAG (SEQ ID NO:61); 

(v) AAGCUU(SEQIDNO:62); 

(vi) CCACACC (SEQ ID NO:63); and 

(vii) UUCAAGAGA (SEQ ID NO:64). 

Another molecular biological approach that may be used is co-suppression. The 
mechanism of co-suppression is not well understood but is thought to involve post- 
transcriptional gene silencing (PTGS) and in that regard may be very similar to many 
examples of antisense suppression. It involves introducing an extra copy of a gene or a 
fragment thereof into a plant in the sense orientation with respect to a promoter for its 
expression. The size of the sense fragment, its correspondence to target gene regions, 
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and its degree of homology to the target gene are as for the antisense sequences 
described above. In some instances the additional copy of the gene sequence interferes 
with the expression of the target plant gene. Reference is made to Patent specification 
WO 97/20936 and European patent specification 0465572 for methods of implementing 
co-suppression approaches. The antisense, cosuppression or double stranded RNA 
molecules may also comprise a largely double-stranded RNA region, preferably 
comprising a nuclear localization signal, as described in PCT/AU03/00292. In a 
preferred embodiment, the largely double-stranded region is derived from a PSTVd type 
viroid or comprises at least 35 CUG trinucleotide repeats. 

Transgenic Plants 

The term "plant" refers to whole plants, plant organs (e.g. leaves, stems roots, 
etc), seeds, plant cells and the like. Plants contemplated for use in the practice of the 
present invention include both monocotyledons and dicotyledons. Preferably, the plant 
is a horticultural plant or cotton. 

The term "cotton" as used herein includes any species of the genus Gossypium 
which is used for commercial fibre production, preferably G. hirsutum or G. 
barbadense. 

Transgenic plants, as defined in the context of the present invention include 
plants (as well as parts and cells of said plants) and their progeny which have been 
genetically modified using recombinant techniques. This would generally be to either i) 
cause or enhance production of at least one protein of the present invention in the 
desired plant or plant organ, or ii) disrupt the production and/or activity of a polypeptide 
of the present invention. Transformed plants contain genetic material that they did not 
contain prior to the transformation. The genetic material is preferably stably integrated 
into the genome of the plant. The introduced genetic material may comprise sequences 
that naturally occur in the same species but in a rearranged order or in a different 
arrangement of elements, for example an antisense sequence. Such plants are included 
herein in "transgenic plants". A "non-transgenic plant" is one which has not been 
genetically modified with the introduction of genetic material by recombinant DNA 
techniques. 

Several techniques exist for introducing foreign genetic material into a plant cell. 
Such techniques include acceleration of genetic material coated onto microparticles 
directly into cells (see, for example, US 4,945,050 and US 5,141,131). Plants may be 
transformed using Agrobacterium technology (see, for example, US 5,177,010, US 
5,104,310, US 5,004,863, US 5,159,135). Electroporation technology has also been 
used to transform plants (see, for example, WO 87/06614, US 5,472,869; 5,384,253, 
WO 92/09696 and WO 93/21335). In addition to numerous technologies for 
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transforming plants, the type of tissue which is contacted with the foreign genes may 
vary as well. Such tissue would include but would not be limited to embryogenic tissue, 
callus tissue type I and II, hypocotyl, meristem, and the like. Almost all plant tissues 
may be transformed during development and/or differentiation using appropriate 
techniques described herein. 

A particularly preferred method of producing a transgenic cotton plant is by 
Agrobacterium-mediated transformation of cotyledons, followed by the induction of 
callus formation, and the subsequent induction of embryogenic callus, and regeneration 
into plants. 

A number of vectors suitable for stable transfection of plant cells or for the 
establishment of transgenic plants have been described in, e.g., Pouwels et al., Cloning 
Vectors: A Laboratory Manual, 1985, supp. 1987; Weissbach and Weissbach, Methods 
for Plant Molecular Biology, Academic Press, 1989; and Gelvin et aL, Plant Molecular 
Biology Manual, Kluwer Academic Publishers, 1990. Typically, plant expression 
15 vectors include, for example, one or more cloned plant genes under the transcriptional 
control of 5' and 3 f regulatory sequences and a dominant selectable marker. Such plant 
expression vectors also can contain a promoter regulatory region (e.g., a regulatory 
region controlling inducible or constitutive, environmentally- or developmentally- 
regulated, or cell- or tissue-specific expression), a transcription initiation start site, a 
20 ribosome binding site, an RNA processing signal, a transcription termination site, and/or 
a polyadenylation signal. 

Examples of plant promoters include, but are not limited to ribulose-1,6- 
bisphosphate carboxylase small subunit, beta-conglycinin promoter, phaseolin promoter, 
high molecular weight glutenin (HMW-GS) promoters, starch biosynthetic gene 
25 promoters, ADH promoter, heat-shock promoters and tissue specific promoters. 
Promoters may also contain certain enhancer sequence elements that may improve the 
transcription efficiency. Typical enhancers include but are not limited to Adh-intron 1 
and Adh-intron 6. 

Constitutive promoters direct continuous gene expression in all cells types and at 
30 all times (e.g., actin, ubiquitin, CaMV 35S). Tissue specific promoters are responsible 
for gene expression in specific cell or tissue types, such as the leaves or seeds (e.g., zein, 
oleosin, napin, ACP, globulin and the like) and these prompters may also be used. 
Promoters may also be active during a certain stage of the plants' development as well 
as active in plant tissues and organs. Examples of such promoters include but are not 
35 limited to pollen-specific, embryo specific, corn silk specific, cotton fibre specific, root 
specific, seed endosperm specific promoters and the like. 

In a particularly preferred embodiment, the promoter directs expression around 
anthesis which is when fibre initiation and elongation occur. Thus, it is preferred that 
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the promoter is an ovule or fibre specific promoter. Examples include promoters 
described in U.S. 5,495,070, U.S. 5,608,148 and U.S. 5,602,321. 

Under certain circumstances it may be desirable to use an inducible promoter. 
An inducible promoter is responsible for expression of genes in response to a specific 
signal, such as: physical stimulus (heat shock genes); light (RUBP carboxylase); 
hormone (Em); metabolites; and stress. Other desirable transcription and translation 
elements that function in plants may be used. 

In addition to plant promoters, promoters from a variety of sources can be used 
efficiently in plant cells to express foreign genes. For example, promoters of bacterial 
origin, such as the octopine synthase promoter, the nopaline synthase promoter, the 
mannopine synthase promoter; promoters of viral origin, such as the cauliflower mosaic 
virus (35S and 19S) and the like may be used. 

Vectors 

One embodiment of the present invention includes a recombinant vector, which 
includes at least one isolated polynucleotide molecule of the present invention, inserted 
into any vector capable of delivering the nucleic acid molecule into a host cell. Such a 
vector contains heterologous nucleic acid sequences, that is nucleic acid sequences that 
are not naturally. found adjacent to nucleic acid molecules of the present invention and 
that preferably are derived from a species other than the species from which the nucleic 
acid molecule(s) are derived. The vector can be either RNA or DNA, either prokaryotic 
or eukaryotic, and typically is a virus or a plasmid. 

One type of recombinant vector comprises a nucleic acid molecule of the present 
invention operatively linked to an expression vector. The phrase operatively linked 
refers to insertion of a nucleic acid molecule into an expression vector in a manner such 
that the molecule is able to be expressed when transformed into a host cell. As used 
herein, an expression vector is a DNA or RNA vector that is capable of transforming a 
host cell and effecting expression of a specified nucleic acid molecule. Preferably, the 
expression vector is also capable of replicating within the host cell. Expression vectors 
can be either prokaryotic or eukaryotic, and are typically viruses or plasmids. 
Expression vectors of the present invention include any vectors that function (i.e., direct 
gene expression) in recombinant cells of the present invention, including in bacterial, 
fungal, endoparasite, arthropod, other animal, and plant cells. Preferred expression 
vectors of the present invention can direct gene expression in plant cells. 

In particular, expression vectors of the present invention contain regulatory 
sequences such as transcription control sequences, translation control sequences, origins 
of replication, and other regulatory sequences that are compatible with the recombinant 
cell and that control the expression of nucleic acid molecules of the present invention. 
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In particular, recombinant molecules of the present invention include transcription 
control sequences. Transcription control sequences are sequences which control the 
initiation, elongation, and termination of transcription. Particularly important 
transcription control sequences are those which control transcription initiation, such as 
promoter, enhancer, operator and repressor sequences. Suitable transcription control 
sequences include any transcription control sequence that can function in at least one of 
the recombinant cells of the present invention. A variety of such transcription control 
sequences are known to those skilled in the art. 

Another embodiment of the present invention includes a recombinant cell 
comprising a host cell transformed with one or more recombinant molecules of the 
present invention. Transformation of a nucleic acid molecule into a cell can be 
accomplished by any method by which a nucleic acid molecule can be inserted into the 
cell. Transformation techniques include, but are not limited to, transfection, 
electroporation, microinjection, lipofection, adsorption, and protoplast fusion. A 
15 recombinant cell may remain unicellular or may grow into a tissue, organ or a 
multicellular organism. Transformed nucleic acid molecules of the present invention 
can remain extrachromosomal or can integrate into one or more sites within a 
chromosome of the transformed (i.e., recombinant) cell in such a manner that their 
ability to be expressed is retained. 



10 
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Host cells 

Suitable host cells to transform include any cell that can be transformed with a 
polynucleotide of the present invention. Host cells can be either untransformed cells or 
cells that are already transformed with at least one nucleic acid molecule (e.g., nucleic 

25 acid molecules encoding one or more proteins of the present invention). Host cells of 
the present invention either can be endogenously (i.e., naturally) capable of producing 
proteins of the present invention or can be capable of producing such proteins after 
being transformed with at least one nucleic acid molecule of the present invention. Host 
cells of the present invention can be any cell capable of producing at least one protein of 

30 the present invention, and include bacterial, fungal (including yeast), parasite, 
arthropod, animal and plant cells. Preferred host cells are plant cells, in particular 
cotton cells. In a preferred embodiment, the ceils are ovule cells such as the cells of the 
outer integument of cotton ovules. 



35 Antibodies 

The invention also provides monoclonal or polyclonal antibodies to polypeptides 
of the invention or fragments thereof. Thus, the present invention further provides a 
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process for the production of monoclonal or polyclonal antibodies to polypeptides of the 
invention. 

The term "binds specifically" refers to the ability of the antibody to bind to 
proteins of the present invention but not other proteins obtained of the plant. 

5 As used herein, the term "epitope" refers to a region of a protein of the invention 

which is bound by the antibody. An epitope can be administered to an animal to 
generate antibodies against the epitope, however, antibodies of the present invention 
preferably specifically bind the epitope region in the context of the entire protein. 

If polyclonal antibodies are desired, a selected mammal (e.g., mouse, rabbit, 

10 goat, horse, etc.) is immunised with an immunogenic polypeptide. Serum from the 
immunised animal is collected and treated according to known procedures. If serum 
containing polyclonal antibodies contains antibodies to other antigens, the polyclonal 
antibodies can be purified by immunoaffinity chromatography. Techniques for 
producing and processing polyclonal antisera are known in the art. In order that such 

15 antibodies may be made, the invention also provides polypeptides of the invention or 
fragments thereof haptenised to another polypeptide for use as immunogens in animals 
or humans,. 

Monoclonal antibodies directed against polypeptides of the invention can also be 
readily produced by one skilled in the art. The general methodology for making 

20 monoclonal antibodies by hybridomas is well known. Immortal antibody-producing cell 
lines can be created by cell fusion, and also by other techniques such as direct 
transformation of B lymphocytes with oncogenic DNA, or transfection with Epstein- 
Barr virus. Panels of monoclonal antibodies produced can be screened for various 
properties; i.e., for isotype and epitope affinity. 

25 An alternative technique involves screening phage display libraries where, for 

example the phage express scFv fragments on the surface of their coat with a large 
variety of complementarity determining regions (CDRs). This technique is well known 
in the art. 

For the purposes of this invention, the term "antibody", unless specified to the 
30 contrary, includes fragments of whole antibodies which retain their binding activity for 
a target antigen. Such fragments include Fv, F(ab') and F(ab*) 2 fragments, as well as 
single chain antibodies (scFv). Furthermore, the antibodies and fragments thereof may 
be humanised antibodies, for example as described in EP-A-239400. 

Antibodies of the invention may be bound to a solid support and/or packaged 
35 into kits in a suitable container along with suitable reagents, controls, instructions and 
the like. 

Preferably, antibodies of the present invention are detectably labeled. Exemplary 
detectable labels that allow for direct measurement of antibody binding include 
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radiolabels, fiuorophores, dyes, magnetic beads, chemiluminescers, colloidal particles, 
and the like. Examples of labels which permit indirect measurement of binding include 
enzymes where the substrate may provide for a coloured or fluorescent product. 
Additional exemplary detectable labels include covalently bound enzymes capable of 

5 providing a detectable product signal after addition of suitable substrate. Examples of 
suitable enzymes for use in conjugates include horseradish peroxidase, alkaline 
phosphatase, malate dehydrogenase and the like. Where not commercially available, 
such antibody-enzyme conjugates are readily produced by techniques known to those 
skilled in the art. ' Further exemplary detectable labels include biotin, which binds with 
10 high affinity to avidin or streptavidin; fiuorochromes (e.g., phycobiliproteins, 
phycoerythrin and allophycocyanins; fluorescein and Texas red), which can be used 
with a fluorescence activated cell sorter; haptens; and the like. Preferably, the 
detectable label allows for direct measurement in a plate luminometer, e.g., biotin. Such 
labeled antibodies can be used in techniques known in the art to detect proteins of the 

15 invention. 

Assessing F ibre Properties 

Fibres produced from plants of the invention are compared to control fibres (e.g., 
fibres from wild-type plants or plants transformed with marker nucleic acids) to 
20 determine the extent of modulation of fibre properties. Modulation of fibre properties, 
such as fibre number, length, strength, or fineness, is achieved when the percent 
difference in these fibre properties of the plants of the invention and control plants is at 
least about 10%, preferably at least about 20%, most preferably at least about 30%. 

Several parameters can be measured to compare the properties or quality of 
25 fibres produced from plants of the invention to wild-type plants. These include: 1) fibre 
number, 2) fibre length; 3) fibre strength; and 4) fineness of fibres. 

A number of methods are known in the art to measure these parameters, such as 
described in U.S. 5,495,070. For example, instruments such as a fibrograph and HVI 
(high volume instrumentation) systems can be used to measure the length of fibres. The 
30 HVI systems can also be used to measure fibre strength. Fibre strength generally refers 
to the force required to break a bundle of fibres or a single fibre. In HVI testing, the 
breaking force is expressed in terms of "grams force per tex unit." This is the force 
required to break a bundle of fibres that is one tex unit in size. In addition, fineness of 
fibres can be measured,, e.g., from a porous air flow test. In a porous air flow test, a 
35 weighed sample of fibres is compressed to a given volume and controlled air fl6w is 
passed through the sample. The resistance to the air flow is read as micronaire units. 
More specifically, the micronaire value is a measurement of cotton fibre quality that is a 
reflection of both fineness and maturity; low values indicate fine and/or immature fibre; 
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high values indicate coarse and/or mature fibres. These values are determined 
according to standard techniques by measuring the resistance offered by a plug of cotton 
to airflow (supra) that is influenced by a combination of fineness and maturity. 
Fineness is the outside diameter of the fibre that is measured in mTex (Millitex or 

5 mg/km). Maturity is the degree of wall thickening of the fibre. Short fibre count (w) % 
refers to the percentage of short fibre weight; short fibre count (n) % refers to the 
percentage of short fibre yield. Immature fibre count refers to the number of immature 
fibres, i.e., fibres in which the thickening of the fibre wall is appreciably less than 
normal. An increase in fibre yield (fibre weight/seed) can be measured by using the 

10 Advanced Fibre Information System (AFIS). Using these and other methods known in 
the art, one of skill can readily determine the extent of modulation of fibre 
characteristics, quality and/or yield plants produced by the methods of the invention. 

EXAMPLES 

15 yvamnle 1 - f-nmnarison nf fibre initi a ls development in wild-type and lintless 
mutants of cotton. 

To identify genes that were specific to fibre initiation, genes were identified that 
were differentially expressed in early stage fertilised ovules of wild-type and lintless 
mutants of cotton that produce little if any fibres. The cotton lines (Gossypium hirsutum 
20 L.) used included two wild type cotton lines: Deltapine 16 (DP16) and Xu-142, and 6 
lintless lines: Lintless 1A, Lintless 4A, Lintless 5B, Lintless 53, SL1-7-1 andfl. The 
1A, 4A, 5B, 53 lines were obtained from the Queensland Department of Primary 
Industry' Tropical Crops and Pastures Germplasm Collection and were originally 
selections from a linted cultivar B1278 isolated by Dr Alistair Low (unpublished, 
25 CSIRO Irrigation Research, Griffith, NSW). SL1-7-1 was obtained from USDA-ARS 
(College Station, Texas, USA). The Fuzzless-lintless, fl, isolated from Xu-142 
background, and Xu-142 were provided by Prof. Xiao-Ya Chen (Institute of Plant 
Physiology, Chinese Academy of Science, Shanghai, China). All the cotton lines were 
grown in a glasshouse with temperature of 30 °C /22 °C (day/night). Ovules were 
30 always collected at a similar time each day (1-3 pm) and samples to be compared on a 
microarray were only used when they were collected on the same date and from the 
same glasshouse to minimise between time or location variability. 

' The mutants were of varying provenance but it is not known whether they are 
allelic. The four lines Lintless 1 A, Lintless 4A, Lintless 5B, Lintless 53 were originally 
35 separate selections from a fully linted cultivar B1278 as spontaneous mutants that 
showed low but differing levels of lint production isolated as part of a breeding program 
to produce a cultivar with high quality seed oil and protein but with only a small amount 
of lint to retain the seeds in the boll capsule (Alistair Low, unpublished). Genetic 
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complementation testing has not yet been carried out. SL 1-7-1 (Mississipi Obsolete 
Collection Number 0504) was also a naturally occurring variant (Turley and Ferguson, 
1996) that produced less lint than the most lintless of the B1278 selections. All the 
mutant lines, but particularly 5B which had the most lint of any of the lines, show a 
5 variable, but low level of leakiness and this may be a result of environmental or 
physiological influences on lint production. At maturity the seeds of all of the mutant 
lines exhibit a fuzz-less phenotype (ie they have a completely naked seed and lack the 
short fuzz fibres covering the ovules of most Upland cotton varieties, including B1278). 
The growth rate and general vegetative and floral development of the mutants were 
10 similar to the wild type except that lintless 4 A was slightly slimmer, and taller (about 
20%) than the Deltapine 16 (DP16) variety used as the wildtype, whereas SL 1-7-1 (SL) 
exhibited higher levels of red anthocyanin pigments in all parts of the plant. All lines 
produced normal amounts of trichomes on their stems and leaves. 

Scanning electron microscopy was used to examine the fibre development of the 
15 5 lintless cotton mutants (Lintless 1A, 4A, 5B, 53 and SL) at two days before anthesis (- 
2 dpa), the day of anthesis (0 dpa) and two days after anthesis (2 dpa). Cotton ovules 
were collected using the phyllotactic arrangement of cotton flowering nodes and size of 
cotton flower buds as indicator of the development stage as described by Hasenfiratz et 
al., (1995). The collected ovules were observed using an Oxford CT 1500 cryotrans 
20 system attached to a JEOL 6400 scanning electron microscope as described by Craig 
and Beaton (1996). There was generally no obvious difference between the mutants and 
the wild type (DPI 6) at -2 dpa except that 4A ovules were covered with mucous-like 
substances. The ovule surfaces were flat and epidermal cells were interspersed with 
stomata. On the day of anthesis, a few fibre initials became visible on the mutant ovules 
25 but these were considerably fewer in number than on wild type ovules. The mucous- 
like substances covering 4A ovule disappeared at this stage. The mutant phenotype was 
best revealed at two days post-anthesis when the fibres have begun to elongate. At this 
time, all of the mutants had a much reduced number of fibre initials on the surface of 
ovules and those fibres that had developed were slower to elongate and less 
30 synchronized in their elongation than the fibre initials of the wild type. The degree of 
lintlessness varied among the mutants with some lines showing very few fibre initials 
(Lintless 4A, 1A and 53) and the others more (Lintless 5B and SL-1-7-1), but still 
considerably fewer than DPI 6. Early fibre growth was sensitive to environmental 
conditions demonstrated by the more rapid fibre growth of the wild type cotton grown in 
35 glasshouses with a temperature regime of 30°C/22°C (day/night) than in a glasshouse 
with temperature of 25°C/1 5°C. 
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Fxamnle 2 - Differential expressio n of genes ii» tht» mutant ovules compared to 
wild-type. 

cDNA Library Construction 

CHX cDNA library was constructed using cycloheximide treated ovules. Cotton 
flower buds of developmental stages of -3 dpa, -2 dpa, -1 dpa and 0 dpa were detached 
and pooled from glasshouse grown cotton plants and surface sterilized by dipping in 
70% ethanol and flaming twice. The cotton ovules were dissected out under sterile 
conditions and cultured on 15 ml of cotton ovule culture medium (Beasley and Ting, 
1973) supplemented with 5 pM IAA (indole acetic acid) and 1 uM GA in 100 ml glass 
flasks at 29 °C in the dark overnight and then treated with 10 pM cycloheximide for 4 
hours under the same culture conditions. After the cycloheximide treatment, the ovules 
were rinsed with sterile water, stored in RNAlater solution (Ambion) at -20 °C Total 
RNA was isolated using a method described by Wu et al. (2002). Purification of poly A + 
mRNA from total RNA was carried out using Qiagen Oligotex mRNA kit (Cat. No. 
70042), following the manufacturer's protocol. In vitro translation was carried out to 
verify the bioactivity of the poly A + RNA (Wu et al. 2002). 5 ug poly A + mRNA from 
cycloheximide treated ovules was used for cDNA synthesis using a Life Technologies' 
Superscript Choice system (Cat. Series .18090) following the manufacturer's 
instructions. The first strand cDNA was synthesized using a mix of 1 ug OHgo(dT)12- 
18 primer and 50 ng of random hexamers. The EcoW adapted cDNA was size^ 
fractionated and cDNA longer than 500 bp was randomly clonedrin ^ZipLox EcoRI 
arms (Life Technology Cat No. 15397-029). This library was comprised of 2 x 106 
primary pfu with average insert size of 1 .05 kb. 

OCF cDNA library was developed from DPI 6 0 dpa ovules. The total RNA 
isolation and poly A + mRNA purification was the same as for CHX libraryconstruction. 
5 pg poly A + mRNA was used for cDNA synthesis and cDNA library construction using 
a Life Technologies' Superscript lamda system (Cat. No. 19643-014) following the 
manufacturer's instructions. The first strand cDNA was synthesized using the NotI 
primer-adapter, in the presence of 1 pCi [ct- 32 P] dCTP. The Sail adapted and NotI 
digested cDNA was size-fractionated and the cDNA longer than 500 bp was 
directionally cloned in ^ZipLox Nod- Sail arms (Life Technology Cat No. 15397-029). 
This primary library was comprised of 5 x 10 5 pfu with average insert size of 0.9 kb. 

For both CHX and OCF libraries, the cDNA could be recovered in the 
autonomously-replicating plasmid pZLl using a in vivo excision protocol provided by 
Life Technology. Two bacterial strains (Life Technology) were used for the excision: 
DH10B (ZIP) for the preparation of double stranded plasmid DNA, and DH12S (ZIP) 
for the preparation of single stranded DNA by infection with helper phage M13K07 
(NEW ENGLAND BioLabs). 
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Normalization of the OCF library 

The OCF library was normalised using a method essentially as described by 
Bonaldo et al. (1996) with some modifications. Single-stranded plasmid DNA (ssDNA) 
was prepared from the excised OCF library using helper phage M13K07 (NEW 
ENGLAND BioLabs) and purified using the Bio-Gel HTP hydroxyapatite (HAP) 
column (Bio-Rad) according to Ali et al. (2000) with some modifications. The ssDNA 
was loaded on to a jacketed HAP column (Bio-Rad) at 60 °C and washed with 3 ml 10 
mM Na-phosphate. The column was washed with 3ml 0.1 6M Na-phosphate buffer and 
the eluate was collected in small fractions (about) 200 pi) and OD 260 was measured for 
> each of the fractions. The fractions with the highest OD measurements were pooled and 
desalted using a Qiaquick PCR purification kit (Qiagen). The driver DNA was prepared 
from the same excised OCF library by Sail and NotI digestion of the double-stranded 
plasmid and the gel-separated cDNA fragments (smear on the gel) was purified using a 
Qiaquick gel extraction kit (Qiagen). Hybridization of driver with ssDNA tracer was 
5 performed in the presence of a 5'-blocking oligo (5'-CCCACGCGTCCG-3') (SEQ ID 
NO:71), and a 3'-blocking oligo (5'-AAAAAAGGGCGGCC-3 ') (SEQ ID NQ:72). The 
hybridization mix comprised 0.5 M NaCl, 10 mM Tris (pH 8.0), 1 mM EDTA, 2 pg 
double-stranded driver DNA, 0.2 pg ssDNA, 10 pg of each of the blocking oligos, and 
25 pi formamide in a total volume of 50 pi. The driver DNA was heat denatured by 
20 boiling for 5 min prior to mixing with the rest of the components and the whole 
hybridisation mix was heated to 65°C for 3 min and then incubated at 42 °C for 24 
hours. The hybridisation mix was run through a HAP column under the same conditions 
as described for ssDNA purification to separate the ssDNA from the hybridised DNA. 
The conversion of purified ssDNA into double-stranded plasmid DNA and 
25 transformation of competent DH10B (Life Technology) was as described by Ali et al. 
(2000). The. normalized library was designated the ON library consisting 10 s primary 
transformants with average insert size of 0.9 kb. 

PCR amplification ofcDNA clones and microarray preparation 

30 The cotton ovule cDNA microarray comprises a total of 10410 cDNA clones. 

Except for 52 clones encoding known cotton genes provided by colleagues, and 13 
clones of negative controls (non-plant genes, intron sequences etc.), me rest of the 
clones were randomly picked from the CHX, OCF and ON cDNA libraries, including 
5496 clones from CHX library, 1 149 clones from OCF library and 3700 clones from the 

35 ON library. 

All the anonymous cDNA clones from the ovule cDNA libraries were PCR 
amplified in 96 well PCR plates (AB gene). The PCR reaction contained of 2 mM 
MgCb, 0.2 mM each of the dNTPs, 0.2 pM each of the M13/pUC forward and reverse 
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primers, 1 unit Taq F2 DNA polymerase and lx F2 buffer (BIOTECH International 
Limited) in a 50 pi reaction with 2 pi overnight cultured bacterial cells as template. A 
row of 12 samples from each PCR plate was verified by agarose gel electrophoresis. 
The PCR fragments were ethanol precipitated and resuspended in 10 pi 50% DMSO and 
8 pi of the fragments was transferred to 384 well plates for microarray slide printing. 
The PCR fragments were arrayed onto CMT-GAPS coated microarray slides (Corning) 
using a Virtek ChipWriter Pro (Virtek Biotech) arrayer. Post-printing slide processing 
was performed by baking the slides at 80 °C for 3 hours as described in the 
manufacturer's technical manual. 



Micoarray analysis 

Cotton ovules used for RNA isolation were kept in RNAlater solution (Ambion) 
at 4°C overnight and then stored at -20°C. For separating the ovule outer integument 
from the inner tissues, ovules stored in RNAlater were used and the separation was 
15 performed under a microscope at room temperature. Total RNA isolations were 
performed using a method described by Wu et al., (2002). Purification of poly A + 
mRNA from total RNA was carried out using Oligotex mRNA kit (Qiagen, Cat. No. 
70042), following the manufacturer's protocol. The cotton ovule cDNA microarray 
comprises a total of 10410 cDNA clones. Except for 52 clones encoding known cotton 
20 genes, the clones were randomly picked from.cDNA libraries constructed from DP-16 
ovules of -3 dpa to 0 dpa. 

For microarray probe labelling, equal amounts of mRNA (0.5-1 pg) of two 
compared samples were reverse transcribed using Superscript II reverse transcriptase 
(Life Technologies), using a combination of 1 pg oligo(dT)12-18 primer and 6 pg 
25 random primers (Life Technologies) per reaction. The purification and Cy3-dUTP and 
Cy5-dUTP (Amersham Pharmacia Biotech) labelling of the first strand cDNA was 
•essentially as described by Schenk et al., (2000). The labelled probes were combined 
and purified using a Qiaquick PCR purification kit. The conditions for slide 
hybridisation and washing were as described in the manufacturer's instruction manual 
30 (Corning, CMT-GAPS coated slides). The microarray images were scanned using a 
GenePix 4000A microarray scanner (Axon Instruments, Union CA, USA). A typical 
microarray comparison consisted of 4 replicates unless otherwise specified. This 
included two biological replicates and each biological replication contained two dye- 
swapped hybridisations. In a time course comparison where RNA from DP16 ovules of 
35 -4 dpa, -2 dpa and +2 dpa were compared to RNA from 0 dpa DP 16 ovules, most 
comparisons consisted of 4 replications as mentioned above, however, self-comparison 
of 0 dpa to 0 dpa comprised 3 biological replications. 
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Scanned microarray images were analysed using the GenePix Pro program 
(Axon Instruments, Union CA, USA). Grids were predefined and manually adjusted to 
ensure optimal spot recognition and bad spots, eg. dust contamination etc., were 
flagged. Spots were quantified using the GenePix's fixed circle method, and medians of 
the fluorescence intensity of the red and green channels were used to calculate the ratio 
of the two channels. The data were log 2 transformed and normalised using a spatial 
normalization method described by Wilson et al., (2003). The data were then rescaled 
by dividing by an estimate of the median absolute deviation (Wilson et al., 2003) before 
running the "find differentially expressed gene" function of tRMA (tools for R 
Microarray Analysis available via http://www.pi.csiro.au/gena/). For a typical 
microarray comparison that consisted of 4 replications, the "find differentially expressed 
gene" function of tRMA was used to select differentially expressed genes from each of 
the replications separately, and the gene lists were then compared and genes occurring 
in at least 3 of 4 replications were classified as differentially expressed genes for this 
comparison. 

Gene expression in 0 dpa whole ovules of each of the mutants was. compared to 
the wild-type DP16 at the same stage using microarray analysis as described above to 
identify differentially expressed genes. The number of genes identified are shown in 
Table 2. Each experiment was replicated a number of times as both biological replicates 
and dye swap replicates. The total number of cDNA clones that are differentially 
expressed amongst the mutants compared to the wild-type varied significantly, from an 
average of 60 clones in the Lintless 4A/DP16 comparison up to an average of 243 
clones in the Lintless 53/DP16 comparison. In addition, the proportion of cDNA clones 
that are up or down-regulated also varied amongst the mutants with 4 mutant lines (1 A, 
53, 5B and SL) showing a higher number of cDNA clones that were up-regulated than 
down-regulated, and vise versa for the mutant Lintless 4A. These results may reflect the 
diverse genetic backgrounds or different genetic lesions of the mutants in addition to the 
variable amounts of lint produced by each line. 



TABLE 2. Number of cDNA clones that are up- or down-regulated in each of the 
mutants as compared to the wild type 



Comparison 


Genes 

Differentially 
Expressed 


Genes 
Up in 
Mutant 


Genes 
Down in 
Mutant 


Number of 
Replicates* 


Minimal 

Reproducibility* * 


4A/DP16 


60 


9 (15%) 


51 (85%) 


8 


75% 


1 A/DPI 6 


67 


48 (72%) 


19 (28%) 


8 


75% 


SL/DP16 


102 


88 (86%) 


14 (14%) 


6 


83% 


5B/DP16 


144 


91 (63%) 


53 (37%) 


4 


75% 


53/DP16 


243 


199 (82%) 


44(18) 


4 


75% 
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* Replicates consist both biological and technical replications. Each biological replicate 
(a RNA isolation) comprises two dye-swaped technical replicates. Number of 
biological replicates = Total number of replicates/2. 

** Genes identified as being significantly differentially expressed in at least 6 out of 8 
5 or 3 out of 4 replicates (75%), or 5 out of 6 replicates (83%). 

Example 3 - Identification of genes that are differe ntially expressed in seed coat 
outer integument of lintless mutants. 

Since the collected embryos described above may already have been pollinated 

10 and zygote development initiated, a separate microarray comparison was made between 
the mRNAs of the outer integument and those of the inner ovule tissues of the wild-type 
cotton, to filter out those genes that were not expressed specifically in the seed coat 
outer integument where fibres are initiated. Cotton ovules at 0 dpa are rapidly 
developing complex organs, composed of at least three separable layers of tissues: the 

15 outer integument, the inner integument and the nucellus (including a developing 
zygote). The genes identified as being differentially expressed in the mutant/wild-type 
comparisons might be constitutively expressed throughout the whole ovule or they may 
be expressed in only one or two. of the layers. As cotton fibres develop only from the 
epidermal cells of the outer integument, genes that showed a higher expression level in 

20 this layer should be more relevant to fibre initiation and development compared to the 
genes that are predominantly expressed in the inner integument and nucellus. Outer 
integuments were therefore separated from the inner integuments and nucellus of 0 dpa 
wild type ovules by microdissection and labeled cDNA prepared from the partitioned 
tissues as described above. The gene expression in the outer integument was then 

25 compared to that of the inner integument and nucellus by probing the ovule cDNA 
microarray. The results, averaged over four replicates (two biological replicates each 
consisting of two dye-swapped technical replicates) revealed a total of 120 cDNA 
clones that were differentially expressed with 65 clones up-regulated and 55 clones 
down-regulated in the outer integument of wild type ovules. The list of 65 outer 

30 integument up-regulated clones was then used as a filter on the differentially expressed 
gene lists identified from the lintless mutantfwild type comparisons to select for cDNA 
clones that were up-regulated in the outer integument. This filtering resulted in the 
identification of a surprisingly small number of genes: 4, 7, 6, 10, 4 genes from the 1A, 
4A, 5B, 53 and SL mutants respectively that were both differentially expressed in 
35 mutant/wild type comparisons and up-regulated in the outer integument. 

There was significant overlap of the cDNA clones amongst the different mutants 
and in total only 11 unique cDNA clones were identified from this experiment as 
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potential candidate genes involved in early stage fibre development. The changes in 
relative expression in each mutant are summarized in Table 3. 

TABLE 3. Genes that were up- or down-regulated in the lintless mutants and up- 
5 regulated in the outer integument 1 



Name 


Clone No 


SL/ 
DP 


1AA 
DP 


53/ 
DP 


4A/ 
DP 


5B/ 
DP 


OI/I 
I c 


Most Homologous 
gene d 


GhMyb25 


ON035F4 


-5.8 


-10.1 


10.8 


-10.8 


10.8 


4.0 


GHMYB25 


GhFaEl 


ON035N 
9 


-5.6 


-6.7 


-5.7 


-7 


-5.8 


4.7 


(NP_195909) 
Transferase 


GhFUl 


ON035C 
9 


-8.5 


-11 


14.1 


-15.2 


13.8 


J.. 1 


unKnown 


a- 

Expansin 


Pfsl4x 


-6.8 


-7.6 


-9.1 


/ 


-9.1 


3.3 


Alpha-expansin 


GhFU2 


ON003F1 


/ 


/ 


-5.9 


-5.6 


-8.2 


2.8 


Unknown 


GhHDl 


ON033M 
7 


/ 


/ 


/ 


-5.5 


-4.9 


2.8 


(T05850) 

HomeoDox protein 
ATML1 


GhTMTP. 


CHX015 
K18 


/ 


/ 


/ 


-8.1 


-4.4 


4.7 


(NJr_l /dz>d / ) Air- 
dependent 
transmembrane 
transporter 


GhCycD3 
:1 


OCF07F4 


/ 


/ 


/ 


.-5.3 


-3.7 


4.9 


(AAQ19972) 
Cyclin D3 


CHX007 
D10 b 


CHX007 
D10 


/ 


/ 


3.2 


/ 


/ 


4.4 


(AC084282) 
Putative protein 
phosphotase/(BAB 
83948) CIG1 


GhSus 


CHX002 
CIO 


/ 


/ 


/ 


/ 


-3.5 


4.6 


(AAD28641) 
Sucrose synthase 


GhLTP 


ON033M 
19 


/ 


/ 


/ 


/ 


-5.9 


3.2 


(AAM62634) Lipid 
transfer protein 



a The values presented in the table are the medians of Log 2 transformed, normalized and 
rescaled ratios of the two compared samples. The rescaling was performed by dividing 
10 through by an estimate of the median absolute deviation (Wilson et al., 2003). 
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b Most probably a chimeric clone. 

c OI: Outer Integument; II: Inner Integument and Nucellus. 

d Most homologous gene based on the top BlastX identity score, with Genbank 
Accession No. and putative biochemical function. 

Except for clone CHX007D10, which appears to be a chimeric clone, that was 
up-regulated in mutant Lintless 53, the rest of the clones were all down-regulated in the 
mutants. Three genes (corresponding to clones ON035F4, ON035N9, ON035C9) were 
down-regulated in all 5 mutants and the other clones are down-regulated in up to 4 of 
the mutants. Sequence analysis showed a range of genes that had not been identified or 
characterized previously as important in early fibre development, including two 
transcription factors (GhMyb25 and a gene encoding a putative homeodomain protein); 
a cyclin D3 homolog; a transferase protein; a transmembrane transporter and two genes 
of unknown biochemical function. For ease of referring to the different genes, we have 
assigned them gene names that refers to their presumed functions, such as GhHDl to 
refer to the cotton homeodomain protein like gene represented by ON035N9 (Table 3). 

Example 4 - F.vnression natterns o f penes in wild-type developing ovules. 

RNA from DP 16 ovules of -4 dpa, -2 dpa and +2 dpa was compared to RNA 
from 0 dpa DP 16 ovules using microarrays to profile the temporal changes in expression 
of genes.around the time of fibre initiation (DP16 time course). RNA from -2 dpa and 
+2 dpa ovules from the Lintless 4A mutant, which shows more severe lintless 
phenotypes amongst the B1278 mutants, was compared with RNA from DP ovules of 
corresponding stages to reveal the temporal profiles of the genes inhibited in mutant 4A 
(4 A/DP multi-time point comparison). The results of these experiments for the 
identified candidate genes are shown in Fig. 1. 

The DP16 time course showed two classes of expression profiles: Class I genes 
showed peak expression at 0 dpa; while Class II genes exhibited increased expression 
towards +2 dpa (Fig. 1. Column A). Three genes, GhMyb25, the GhHDl and 
GhCycD3;l, had a Class I expression profile suggesting a role in the early events of 
fibre initiation at anthesis. The expression of GhCycD3;l increased continually from -4 
dpa to 0 dpa and plateaued between 0 dpa and +2 dpa, while GhMyb25 and the GhHDl 
exhibit a dip in expression at -2 dpa followed by a peak at 0 dpa and then a decline 
towards +2 dpa. The peak expression at 0 dpa of these three genes coincided with the 
time of fibre initiation. The rest of the genes all showed a Class II expression pattern 
although the specific details differed among them. The expression of GhFUl and 
GhSus (sucrose synthase) increased gradually in the time period examined. GhFaEl 
(transferase family) and the expression of GhTMTP (transmembrane transporter) 
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showed a slight decrease from -4 dpa to -2 dpa and then increased gradually towards +2 
dpa. The third group from this class comprised a-Expansin and GhLTP (lipid transfer 
protein) which show a distinctly flat profile from -4 dpa to 0 dpa followed by a sharp 
increase from 0 dpa to +2 dpa. The last member from this class, GhFU2, exhibited 
increased expression from -4 dpa to -2 dpa and again from 0 dpa to +2 dpa, while the 
expression between -2 dpa to 0 dpa remained unchanged. 

The multiple time point comparisons of 4 A/DP 1 6 revealed the time and duration 
of up- or down-regulation of the genes in the 4 A mutant relative to DP 16 (Fig. 1. 
Column B). Among the Class I genes, none showed significant repression relative to 
DPI 6 at -2 dpa in 4A ovules, instead, the repression started after -2 dpa and reached the 
lowest level at 0 dpa for the GhHDl and GhCycD3;l genes, while GhMyb25 continued 
to decrease slightly after 0 dpa. Class II genes, in a similar fashion, did not show any 
significant repression at -2 dpa, and repression started after -2 dpa for most of the genes 
except GhSus, which only showed a later repression after 0 dpa. All the Class II genes 
exhibited repression at about +2 dpa. Three Class II genes, GhSus, a-Expansin and 
GhLTP, were not identified as differentially expressed genes in the initial 0 dpa 4 A/DP 
comparisons. It became clear from this experiment that the repression of GhSus did not 
occur until after 0 dpa and only became significant at +2 dpa, while the repression of a- 
Expansin and GhLTP began after -2 dpa only became highly significant towards +2 dpa 
when fibres were rapidly elongating in the wild-type. 

Example 5 - Cnnfirmatinn of down -regulation nf fibre initiation genes. 

The genetic background of the fibre mutants (B1278 and SL) used herein was not 
identical to that of the DPI 6 control used in the comparisons. Neither parental genotype 
was available for use in the experiments described above. While the differential 
expression of genes observed between the mutants and the wild-type might have arisen 
due to differences in the genetic backgrounds of the plants, the commonality of the 
genes identified among the different mutants and the fact that some have been 
characterized previously as important for fibre development using different strategies, 
suggested otherwise. A Juzzless-lintless (Jt) mutant had recently been isolated from the 
Chinese G. hirsutum cultivar Xu-142 and used to identify and characterize fibre 
development related genes (Yu et al., 2000, Li et al., 2002, Ji et al., 2003). These two 
lines provided an isogenic pair with which to validate the genes identified from the other 
mutants. RNA from 0 dpa ovules of fl was compared to that of 0 dpa ovules of Xu-142 
and 119 clones were identified as differentially expressed in four replicates. The same 
outer/inner integument gene expression filter as described above was applied to the data 
set and identified 13 differentially expressed genes that were also up-regulated in the 
outer integument of DP16 ovules. Amongst the 13 genes, 8 were in common with genes 
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identified in the other lintless mutant/DP16 comparisons and the results are presented in 
Table 4. This comparison also revealed 5 additional genes that had not been identified 
in the previous 5 mutants. One of the cDNA clones, ON038N8 (886 bp), encodes a Myb 
protein which was 69% identical to the GhMyb25 protein. Other genes include two 
5 different RD22 genes, a second LTP and a putative L-asparaginase. 



TABLE 4 . Genes differentially expressed in ft as compared to Xu-142 



IN dHIC 


Clone No 1 


fl/Xu-142** 1 


VIost Homologous gene*** 


q-Expansin* 


Pfsl4x 


-11.5 


[AF512539) Alpha-expansin 


OrLr U 1 




-9.8 


Unknown 


GhMyb25* 


ON035F4 


-9.0 


fAF336283) GHMYB25 


GhFU2* 


ON003F1 


-7.8 


Unknown 


GhFaEl* 


ON035N9 


-7.4 


(NP 195909) Transferase 


GhHDl* 


ON033M7 


-6.1 


(T05850) Homeobox protein ATML1 


GhLTP* 


ON033M19 


-4.9 


( AAM62634) Lipid transfer protein 


GhTMTP* - 


CHX015K18 


-3.7 


(NP_1 75557) ATP-dependent 
transmembrane transporter 


GhMyb25-like 


ON038N8 


-4.4 


( AF336283) GHMYB25 


GhRD22 


OCF005C10 


-4.4 


(AAL67991). Dehydration-induced 
protein RD22 


GhAsp. 


OCF008G9 


-3.4 


CBAC66615) L-aspraginase 


GhLTP2 


OCF010D8 


3.4 


(CAA65477) Non-specific lipid-transfei 
protein 


GhRD22-like 


OCF006C1 


5.3 


(BAC22498) Resistant specific protein-1 



10 * Genes in common with the candidate genes from the other 5 mutants. 

** The values presented in the table are the medians of Log 2 transformed, normalized 
and rescaled ratios of the two compared samples. The rescaling was performed by 
dividing through by an estimate of the median absolute deviation (computed on the final 
residual mean-difference data) as described by Wilson et al., (2003). 

1 5 *** Based on the top BlastX hit. 

F.Wam ple 6 - Characterisation of GhM vb25 expression. 

The nucleotide sequence of the clone ON035F4 was obtained. It was 1160 
nucleotides in length with a coding region from nucleotides 68 to 995, encoding a 
20 protein which was 98% identical at the amino acid level to GhMyb25 (AF336283), 
expressed in 0 dpa ovules of G. hirsutum cultivar Acala Maxxa (Benjamin Burr, in 
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Genbank). The encoded protein was an R2R3 type of Myb transcription factor. It was 
also 96% identical to the G. arboretum EST (BE054276), suggesting that it was from 
the A-genome present in tetraploid cotton. Outside the R2R3 region, which is highly 
conserved amongst all Myb transcription factors, GhMyb25 showed highest homology 
5 to the Petunia hybrida MYB.Ph3 and Antirrhinum majus MIXTA (AmMIXTA) than to 
Arabidopsis GL1 and cotton MYB A, another cotton myb which caused distinct 
abnormalities when over expressed in transgenic tobacco including the production of 
cotyledonary trichomes (Payne et al., 1999). The sequence of GhMyb25 currently in 
Genbank (AF336283) contained an unspliced intron that was not present in our clone 
10 (nucleotides 201-282 of AF336283). 

Reverse txanscription-polymerase chain reaction (RT-PCR) assays were used to 
analyse expression of the gene as follows. Total RNA samples isolated from cotton 
tissues were DNase (RQ1 RNase-free DNase, Promega) treated and 0.5 jxg of the total 
RNA was used in a RT-PCR reaction. The first strand cDNA synthesis was performed 
1 5 using Superscript II reverse transcriptase and buffer supplied by the manufacturer (Life 
Technologies). The RT-PCR reaction was performed essentially as described by 
McFadden et al., (2001). The cotton p-tubulin gene was used as a control in all the RT- 
PCR reactions. The forward and reverse primers used for p-tubulin are 5'- 
AGAAC ATGATGTGTGCTGC-3 ' (SEQ ID NO:65) and 5'- 
20 AGCTGTGAACTGCTC ACTC-3 ' (SEQ ID NO:66) respectively and the resulting 
cDNA fragment was 300 bp. The forward and reverse primers used for GhMyb25 RT- 
PCR were: 5'-TCAAACCCTCCTCAAAGCAACC-3 , (SEQ ID NO:67) and 5'- 
ATTCC ATTACC AGACGATGATGAC-3 ' (SEQ ID NO:68) respectively and this 
produced a cDNA fragment of 224bp. The GhMyb25 and p-tubulin RT-PCR reactions 
25 were performed in a one-tube reaction amplified with an initial denaturation cycle at 
95°C for 3 min followed by 23 cycles at 95°C for 15 sec, 55 °C for 15 sec, 72 °C for 1 
min and with a final cycle of 72°C for 2 min. 5 jLtl of the RT-PCR reaction was checked 
on a 2% agarose gel and the gel was Southern-blotted to Hybond-N* membrane 
(Amersham Pharmacia Biotech) and hybridized with 32 P- labelled probe derived from 
30 the cDNA clone of GhMyb25 . 

GhMyb25 expression was detected in 0 dpa wild-type ovules and not in petal, 
leaf and stem using RT-PCR (Fig. 2, panel a). GhMyb25 expression was detected in -2, 
0, 2 and 5 dpa ovules with highest expression in 0 and 2 dpa ovules (Fig. 2, panel b). 
No expression of GhMyb25 was visible in ovules of -2, 0 and 2 dpa from three of the 
35 lintless mutants, 1 A, SL1-7-1 and 4A (Fig. 2, panel c) although very low expression in 
the mutant ovules was revealed after hybridizing RT-PCR products with an ON035F4 
probe (Fig. 2, panel d), consistent with the low levels of lint production that still occurs 
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on these ovules. GhMyb25 expression peaked at 0 dpa in lines 1 A and SL1-7-1, whereas 
in 4A, the highest expression was detected slightly later at 2 dpa. 

Example 7 - Characterisation of the homeodo main protein gene GhHDl. 

5 ON033M7, a partial cDNA clone of 442 nucleotides, contained a small region at 

the 3'end of its encoded protein with homology to two homeodomain proteins including 
protodermal factor 2 (NP_567274) (26/28 identical amino acids) and the Ll-specific 
and ovule specific homeodomain gene ATML1 (T05850) (26/30, 86%). ATML1 has 
been grouped with the Arabidopsis GLABRA2 in the same HD-GL2 class and they also 

10 share a common LI layer-specific or dermal-specific pattern of expression (Lu et al., 
1 996). The cotton gene was designated GhHDl . 

RT-PCR experiments were carried out to analyse expression, in similar fashion 
to those described above. The forward and reverse primers used for the GhHDl RT- 
PCR were: 

15 5 ' -GCTTTCTCTTGG ATC AG-3 ' (SEQ ID NO:69) and 5'- 

C A AT AAC AC ATGAA ACC AG-3 ' (SEQ ID NO:70) respectively and these resulted in 
a cDNA fragment of. 384bp. The GhHDl and P-tubulin RT-PCR reactions were 
performed separately under the conditions described above. 10 pi of the RT-PCR 
reaction was electrophoresed on a 2% agarose gel and the gel was Southern-blotted to 

20 Hybond-N"" membrane (Amersham Pharmacia Biotech) and hybridized with 32 P- 
labelled probe derived from the cDNA clone of the putative homeodomain gene. Since 
the expression of the p-tubulin appeared to be variable in different cotton tissues, the 
quantification of the GhHDl expression using p-tubulin gene as a standard was only 
performed on the 0 dpa ovules of different cotton lines using an Image-Quant program 

25 (Molecular Dynamics). 

GhHDl was mainly expressed in ovules of various developmental stages and at 
much lower levels in leaves as revealed by RT-PCR (Fig. 3). The expression increased 
at -1 dpa and remained high till 2 dpa. Low expression levels were observed in 4 dpa 
ovules (with fibres attached), 8 dpa ovules (without fibres) and 8 dpa detached fibres. 

30 The expression levels of 0 dpa ovules of mutants 5B and 4A relative to DPI 6 0 dpa 
ovules after normalization with p-tubulin expression is shown in Fig. 3. The expression 
in the mutant ovules was slightly reduced in 5B and remained similar to wild-type in 4A 
ovules. The RT-PCR band was confirmed as GhHDl -specific by Southern blot 
hybridization using the ON033M7 cDNA fragment as probe. 
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Example 8 - Characterisation o f other genes. 

The characteristics of the genes identified in this study are summarized in Tables 



5 and 6. 
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Clone OCF07F4 encoded a cyclin D3, similar to AAQ 19972, 106/149 similar 
amino acids, 71%). 

Example 9 - Fibre cells undergo DNA endoreduplicatio n during initiation* 

5 The observation that a cyclin D3 gene encoded by clone OCF007F4 was down- 

regulated in 0 dpa ovules of mutants 4A and 5B compared to DP 16 prompted the 
inventors to investigate the cell division and DNA replication activities of the 
epidermal layers of DPI 6 and lintless 4A ovules. Ovules of cotton line DPI 6 and 4A at 
-2, -l,and 0 dpa stages were fixed in 3:1 (95% ethanol:acetic acid) for lhr at room 
10 temperature, cleared in 95%/lmM MgCl 2 ethanol over night at room temperature and 
rehydrated through an ethanol series to lOmMTris/lmM MgCl 2 according to 
Szymanski and Marks (1998). The ovules were stained in O.ljag/ml propidium iodide 
for 30 seconds and than distained and kept in lOmMTris/lmM MgCl 2 . 

Nuclear DNA content of ovule epidermal and fibre cells at the chalazal end were 
1 5 measured using a Leica SP2 confocal laser scanning microscope (Leica, Wetzler, 
Germany). At least 200 nuclei were measured from each sample which consisted of at 
least 3 ovules. Fluorescence at 600-740 nm was collected after excitation at 488 and 
543 nm using a 63 X NA 1.25 water-immersion lens. After optically sectioning 
through the ovule epidermis, the mean fluorescence intensity and dimensions of. 
20 epidermal and fibre cell nuclei was measured from the maximum projection of the 
optical stack. Total fluorescence of individual nuclei was calculated by multiplying 
nuclear area by average fluorescence. This value was converted to a ratio by 
normalising against total fluorescence of epidermal cell nuclei at telophase or anaphase 
(2C) within the same image. The normalized values were then used to construct 
25 histograms of epidermal and fibre cell nuclear DNA content. 

Since fibre cells are known to cease dividing after differentiation, the ovule 
epidermal cell division activities of DP16 and 4A-183 were examined at and before 
anthesis. Over the period examined (-2, -1 and 0 dpa), the extent of cell division in the 
ovule epidermis of lintless 4 A and DP 16 were not significantly different (paired t-test, 
30 P = 2.1%) although the division rate in 4A was slightly higher than in DP16 (Table 7). 
It appears that the ovule epidermis division rates are higher at -1 dpa for both DP 16 
and 4A, although the significance of this is unclear. 

35 
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TABLE 7. Cell division rates in ovule epidermal cells of DPI 6 and mutant 4A 



DPA 


DPI 6 


4A 


-2 dpa 


2.78% 


2.97% 


-1 dpa 


3.25% 


3.5% 


0 dpa 


2.69% 


2.84% 5 



Relative DNA contents of the epidermal cells and fibre cells were also 
measured, and normalized using DNA contents of nuclei at anaphase or telophase (2C) 
and the results are presented in Fig. 4. Since the pre-fibre initials and epidermal cell 

10 are visually indistinguishable at -2 and -1 dpa, the data for those times are presented as 
total epidermal cells. The results indicate that epidermal pavement cells of DPI 6 and 
Lintless 4A at these time points have a DNA content peak around 2 to 2.4C, while 
differentiated fibre initials of DP16 when clearly distinguishable at 0 dpa, have an 
increased DNA content with the majority of cells showing a DNA content between 2.8 

15 C and 5.2C. While this result clearly suggests that the majority of fibre cells undergo at 
least one round of DNA endoreduplication during initiation, the involvement of the 
cyclin t>3 gene in this process still needs to be verified. 

y^amnle 10 - Cloning of full-length cDNA sequences and genes encoding therefor 

20 At least two approaches can be used to determine the full length sequence of 

partial cDNA clones described herein. 

One method is to screen a cDNA library, such as the cotton DP-16 ovule -3 dpa 

to 0 dpa library described herein, with a radioactively labelled polynucleotide which 

comprises the known portion of the cDNA. Library screening is performed as 
25 described by Sambrook et al., (supra), or other techniques known to those of skill in the 

art. 

In another method, two primers of about 17 to about 20 nucleotides derived 
from both ends of the known partial sequence are synthesized and used to amplify the 
desired cDNA from a population of cDNA reverse transcribed with a poly-T 

30 comprising primer from mRNA obtained from, for example, cotton DP-16 ovule -3 dpa 
to 0 dpa. The polymerase chain reaction (PGR) is carried out under routine conditions, 
for instance, in 25 pi of reaction mixture with 0.5 jug of the above cDNA mixture. A 
convenient reaction mixture is 1.5-5 mM MgCl 2 , 0.01% (w/v) gelatin, 20 each of 
dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 Unit of Taq polymerase. 

35 Thirty five cycles of PGR (denaturation at 94°C for 1 min; annealing at 55°C for 1 min; 
elongation at 72°C for 1 min) are performed with a Perkin-Elmer Cetus automated 
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thermal cycler. The amplified product is analyzed by agarose gel electrophoresis and 
the DNA band with expected molecular weight (as determined by Northen blot 
analsysi) is excised and purified. The PGR product is verified to be the selected 
sequence by subcloning and sequencing the DNA product. 
5 Several other methods are available for the identification of the 5' or 3' ends of 

an mRNA sequence. These methods include but are not limited to, filter probing, clone 
enrichment using specific probes, and protocols similar or identical to 5' and 3' 
"RACE" protocols which are well known in the art. For instance, a method similar to 
5' RACE is available for generating the missing 5 1 end of a 'desired full-length 
10 transcript. Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a 
population of RNA presumably containing full-length gene RNA transcripts. A primer 
set containing a primer specific to the ligated RNA oligonucleotide and a primer 
specific to a known sequence of the gene of interest is used to PGR amplify the 5' 
portion of the desired full-length gene. This amplified product may then be sequenced 
5 and used to generate the full length cDNA. 

To clone the corresponding gene of a cDNA described herein, a cotton genomic 
library is made in a X vector and plaques obtained by plating 10 6 or more X infectious 
particles at high density in a suitable E. coli host. The plaques are transferred to nylon 
filters. A gene specific probe designed considering the polynucleotides provided herein 
20 is labelled with radioactive label and used to hybridise to the nylon filters. Plaques 
corresponding to spots of hybridisation are isolated and confirmed to be positive for 
desired sequence by second or third rounds of hybridisation. DNA sequencing of the 
gene segments in the X clones is carried out by standard methods to determine the full 
nucleotide sequence of the gene and the flanking upstream and downstream regions. 
25 

Example 11 - Production of transgenic cotton 

The coding sequence of a gene of the invention is operably linked to a 
subterranean clover stunt virus promoter (S7; WO 96/06932) and a 3' transcription 
termination and polyadenylation signal functional in plants. This chimeric gene is 
30 operably linked to a selectable marker gene and introduced into a T-DNA vector. 
Cotton plants are transformed using the Agrobacterium mediated transformation 
technique. Transgenic cotton lines are identified, fibre number, fibre length, fuzz fibre 
length, cellulose content, and dry weight of the lint is analyzed. 

35 
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Example 12 - Discussion 

The use of multiple mutant lines in the above mentioned expression studies 
enabled confirmation and complementation of the findings from one mutant to another 
and focus in on the most critical genes for fibre development. It also helps to smooth 
5 out "noise" contributed by the biological variability in fibre growth and the unknown 
and maybe diverse genetic backgrounds of some of the mutants. The comparison of 
gene expression profiles between the outer integument and the inner ovule tissues 
served as a filter, to eliminate those genes that are not expressed at higher levels in the 
outer integument and helped to focus on a small set of about 10 candidate genes. The 
10 wild type time course data reveal that the expression profiles of these candidate genes 
separated them into two classes with class I genes (GhMyb25, GhHDl and 
GhCycD3;l) showing peak expression at 0 dpa, coinciding with the time of fibre 
initiation; class II genes exhibiting increased expression at 2 dpa, suggesting a more 
important role in fibre elongation. 
15 ' By analogy with the regulatory genes involved in Arabidopsis leaf trichome 

development, it might have been expected to find a Myb transcription factor expressed 
in cotton fibre that was similar to GL1. GhMyb25 (and the GhMyb25-like gene), 
however, shows higher sequence similarity to the Petunia hybrida Myb.Ph3 and the 
Antirrhinum MIXTA than to GL1 or other cotton Mybs. Both Myb.Ph3 and MIXTA 
20 show petal epidermis-specific expression. Based on its expression pattern, it was 
speculated that the function of Myb.Ph3 was to regulate flavonoid biosynthesis (Solano 
et al., 1995), but this speculation has not been confirmed. The function of MIXTA has 
been revealed as a controlling factor for the conical shape of petal epidermal cells 
(Noda et al., 1994). Over-expression of MIXTA in transgenic tobacco lead to 
25 production of supernumerary trichomes on cotyledons, leaves and stems as well as 
novel production of conical cells on leaves (Payne et al., 1999). In contrast, GhMyb25 
is expressed only in the ovules and not in later stage fibres (or petals so is not a 
homolog of MIXTA). It had higher expression in outer integuments and the time of 
expression coincide with fibre initiation. Accordingly, GhMyb25 plays a role in fibre 
30 initiation. The fact that GhMyb25 is down-regulated in all the lintless mutants, 
including the new fl mutant, points to a role as a positive regulator of fibre initiation. 

Comparison of the fl mutant with its parental genotype identified in addition to 
the 8 genes common to other mutants, a second Myb transcription factor, a GhMyb25- 
like gene, containing a conserved region outside the R2R3 domain shared by all the 
35 MIXTA class of Mybs (Stracke et al., 2001). The GhMyb25-like gene is only 64% 
identical to probable A-genome derived GhMyb25 at the nucleotide level (69% 
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similarity at the amino acid level), suggesting it is unlikely to be the homoelogous D- 
genome partner of GhMyb25 present in tetraploid cotton. GhMyb25-like is expressed 
at low level in -4, -2 dpa ovules of DP 16 (0.3 relative to 0 dpa ovule of 1) and increased 
sharply to 1 at 0 dpa and remained at a similar level of 1 at 2 dpa. This expression 
profile indicates a role for GhMyb25-like in fibre initiation. Among the 5 additional 
genes identified in this analysis, there were two RD22 genes, consistent with the 
findings of Li et al., (2002) who' identified a RD22 gene showing fibre specific 
expression using the same lines. 

The putative homeodomain gene identified in this study has high similarity in 
part of its C-terminus to the LI specific and ovule specific homeodomain gene 
ATML1. ATML1 was classified in the same HD-GL2 class as Arabidopsis GL2 based 
on sequence homology and they share a common LI layer-specific or dermal-specific 
pattern of expression (Lu et al., 1996). ATML1 was proposed to be involved in setting 
up morphogenetic boundaries of positional information necessary for controlling cell 
15 specification and pattern formation based on gene expression patterns. GL2 that has 
been studied for its role in trichome, root-hair and seed coat development (Rene et al., 
1994, Cristina et al., 1996, Masucci et al., 1996). The GL2 mutations resulted in 
aborted trichomes with aberrant cell expansion whereas entopic expression noticeably 
increased the number of trichomes and induced clusters of trichome formation (Ohashi 
20 et al., 2002). The GhHDl gene is expressed in ovules with higher expression in outer 
integument, in fibres as well as in leaves and this expression pattern probably reflects a 
more general role in different epidermal cell specification and pattern formation similar 

to that shown by the GL2. 

DNA endoreduplication, a strategy to amplify nuclear DNA without cell 

25 division is a major mechanism leading to somatic polyploidisation in plants (reviewed 
' by Joubes and Chevalier 2000). Correlations have been established between 
polyploidy and bell differentiation and cell expansion. While it is well established that 
Arabidopsis trichomes undergo four rounds of endoreduplication during development, 
leading to branched cells with nuclei containing about 32C DNA (Schnittger et al., 

30 2002), it has been less than clear whether cotton fibre initials undergo a similar process. 
Berlin (1986) studied tritiated thymidine uptake by epidermal layer using in vitro 
cultured cotton ovules and observed that there was an increase in thymidine 
incorporation from -2 dpa to 1 dpa and then the incorporation declined and finally 
stopped at 6 dpa. These observations were interpreted as DNA synthesis in preparation 

35 for cell division. Since fibre initials do not undergo divisions and no thymidine 
incorporation was observed in the elongating fibres, the author suggested that gene 
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amplification did not occur during fibre development over the time observed (Berlin 
1986). Van't Hof (1998) reported that the DNA content of developing cotton fibre 
cells only increased by about 24% from 2 dpa to 5 dpa and suggested that during early 
stages of development fibre cell nuclei either selectively amplify certain sequences or 
5 enter S-phase replicating only a portion of their genome. Using laser-confocal 
microscopy and propidium iodide staining, we examined ovule epidermal cell division 
rates and DNA contents of epidermal cells and fibre cells. Our results show that the 
epidermal cell division rates remain relatively constant from -2 dpa to 0 dpa (with a 
small increase at -1 dpa). While nuclear DNA contents of epidermal cells remain 
10 largely unchanged from -2 dpa to 0 dpa, the fibre initials contain nuclei that mostly 
show higher than 2C DNA content with the majority of cells showing DNA contents 
between 2.8 C and 5.2C. While our results suggest the fibre initials undergo one round 
DNA endoreduplication starting at 0 dpa, they do not exclude the possibility of 
selective amplification of certain sequences or partial replication of the genome at later 
15 stages suggested by Van't Hof s work. In addition, the enlarged nuclei in fibre initials 
revealed by the ultrastructural studies (review by Berlin 1986) provided further support 
to the DNA amplification phenomenon. 

Although accumulating data reveal that DNA endoreduplication is 
developmental^ regulated, it is still poorly understood in plants (reviewed by Joubes 
20 and Chevalier 2000). Assuming the endoreduplication is a modified cell cycle, it may 
share common determinants with the classic cell cycle (Joubes and Chevalier 2000). 
The two main control points in the cell cycle are at the Gl/S and G2/M transitions and 
in most plant cell types, the primary control point probably operates during Gl phase. 
This period not only includes the point of commitment to cell division, but may also 
25 represent the time during which differentiation decisions are made (reviewed by Meijer 
and Murray 2000). Mammalian cyclin D-Cdk4 complexes have been characterized as 
growth factor-responsive cell cycle regulators operating during Gl phase. Cyclin D3 
was found to be present at high levels in megakaryocytes undergoing endoreduplication 
and was upregulated following exposure to the proliferation, maturation and ploidy- 
30 promoting factor, Mpl ligand (Zimmet et al., 1997). In plants, the presence of mutiple 
Cyclin D3 genes raises the question. of functional redundancy of these genes and the 
extent to which they may have distinct or overlapping roles (Meijer and Murray, 2000). 
Arabidopsis CycD3;l, which is highly cytokinin-inducible (Riou-Khamlichi et al. 
1999), when ectopically expressed, induced not only DNA replication but also cell 
35 division in trichomes (Schnittger et al., 2002). In synchronized tobacco BY-2 cell 
suspension cultures, tobacco CycD3; 2 was induced in Gl and remained at a constant 
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level through out the cell cycle, similar to mammalian D-type cyclins. In contrast, 
CycD3;l transcripts accumulated during mitosis, a pattern of expression not normally 
associated with D-type cyclins, suggesting a novel role for plant cyclins during mitosis 
or alternatively a BY-2 cell-specific phenomenon and not a normal feature of plant cell- 
5 cycle progression (Sorrell et al., 1999). The GhCycD3.1 identified in our experiments 
shows highest sequence homology to Euphobia esula cyclin D3;2. Since the decreased 
expression of this gene in the outer integuments of lintless mutant 4 A- 183 did not 
affect the epidermal cell division rates, it is appears that this gene is involved in the 
DNA endoreduplication of fibre initials similar to the tomato CycD3;l's involvement 
10 in endoreduplication of the differentiated giant cells of the fruit gel tissue (Joubes et al., * 
2000). 

It will be appreciated by persons skilled in the art that numerous variations 
and/or modifications may be made to the invention as shown in the specific 
embodiments without departing from the spirit or scope of the invention as broadly 
15 described. The present embodiments are, therefore, to be considered in all respects as 
illustrative and not restrictive. 

All publications discussed above are incorporated herein in their entirety. 
Any discussion of documents, acts, materials, devices, articles or the like which 
has been included in the present specification is solely for the purpose of providing a 
20 context for the present invention. It is not to be taken as an admission that any or all of 
these matters form part of the prior art base or were common general knowledge in the 
field relevant to the present invention as it existed before the priority date of each claim 
of this application. 



Dated this thirty-first day of March 2004 

Commonwealth Scientific and Industrial 
Research Organisation 

Patent Attorneys for the Applicant: 

F B RICE & CO 

25 



Cody provided by USPTO from the IFW Image Database on 04/19/2005 



59 

REFERENCES 

Ali, S., Holloway, B. and Taylor, W.C. (2000) Plant. Mol. Biol. Rep. 18:123-132. 
5 Beasley, C.A. and Ting, LP. (1973) Am. J. Bot. 60:130-139. 

Berlin, J,D. (1986) Cotton Physiology, Memphis, Tennessee, pp 375-414. 
Bonaldo M.F., Lenno, G. and Soares, M.B. (1996) Genome Res. 6:791-806. 

10 

Bourque, J. E. (1995) Plant Sci. 105:125-149. 

Craig, S. and Beaton, CD. (1996) J. Microsc. 182:102-105. 

1 5 Cristina, M.D., Sessa, G., Dolan, L., Linstead, P., Baima, S., Ruberti, I. and Morelli, G. 
(1996) Plant J. 10:393-402. 

Haseloff, J. and Gerlach, W.L. (1988) Nature 334:585-591. 
20 Hasenfratz, M.P., Tsou, C.L. and Wilkins, T.A. (1995) Plant /VijW.108:1395-1404. 
Hulskamp, M., Misera, S. and Jurgens, G. (1994) Cell 76:555-566. 

Ji, S.J., Lu, Y.C., Feng J.X., Wei, G., Li, J., Shi, Y.H., Fu, Q., Liu, D., Luo, J.C. and 
25 Zhu, Y.X. (2003) Nucleic Acids Res. 3 1 :2534-2543 . 

Joubes, J. and Chevalier, C. (2000) Plant Mol. Biol. 43:735-745. 

Joubes, J., Walsh, D., Raymond, P. and Chevalier, C. (2000) Planta 21 1:430-439. 

30 

Lemieux, B. (2000) Current Genomics. 1 : 301 -3 1 1 . 

Li, C.H., Zhu, Y.Q., Meng, Y.L., Wang, J.W., Xu, K.X., Zhang, T.Z. and Chen X.Y. 
(2002) Plant Sci. 1 63 : 1 1 1 3-1 1 20. 

35 

Li, X.B., Cai, L., Cheng, N.H. and Liu, J.W. (2002) Plant Physiol. 130:666-674. 



rvwt n * n ,M<=,rt Kv I ISPTO from the 1FW Image Database on 04/19/2005 



60 

Loguercio, L.L., Zhang J.Q. and Wilkins, T.A. (1999) Mol. Gen. Genet. 261:660-670. 
Lu, P., Porat, R., Nadeau, J.A. and O'Neill, S.D. (1996) Plant Cell 8:2155-2168. 

5 

Masucci, J. D., Rene, W.G., Foreman, D.R., Zhang, M. and Galway, M.E. (1996) 
Development 122:1253-1260. 

McFadden, H.G., Chappie, R, De Feyter, R. and Dennis, E. (2001) Mol. Plant Pathol. 
10 58:119-131. 

Meijer, M. and Murray, J.A.H. (2000) Plant Mol. Biol. 43:621-633. 
Needleman, S.B. and Wunsch, CD. (1 970) J. Mol. Biol. 48:443-453 . 

15 

Noda, K.I., Glover, B.J., Linstead, P. and Martin, C. (1994) Nature 369:661-664. 

Ohashi, Y., Oka, A., Ruberti, I., Morelli, G. and Aoyama, T. (2002) Plant J. 29:359- 
369. 

20 

Oppenheimer, D.G., Herman, P.L., Sivakumaran, S., Esch, J. and Marks, M.D. (1991) 
Cell 67:483-493. 

Payne, T., Clement, J., Arnold, D. and Lloyd, A. (1999) Development 126:561-682. 

25 

Perriman, R, Delves, A. and Gerlach, W.L. (1992) Gene 1 13:157-163. 

Rerie, W.G., Feldmann, K.A. and Marks, M.D. (1994) Genes & Development 8:1388- 
1399. 

30 

Riou-Khamlichi, C, Huntley, R, Jacqmard, A. and Murray, J.A.H. (1999) Science 
283:1541-1544. 

Schellmann, S., Schnittger, A., Kirik, V., Wada, T., Okada, K., Beermann, A., 
35 Thumfahrt, J., Jurgens, G. and Hiilskamp, M. (2002) EMBO J. 21 :5036-5046. 



Codv orovided by USPTO from the IFW Image Database on 04/19/2005 



61 

Schenk, P.M., Kazan, K„ Wilson, I., Anderson, J.P., Richmond, T., Somerville, S.C. 
and Manners, J.M. (2000) Proc. Natl. Acad. Sci. USA 97:11655-11660. 

Schnittger, A., SchSbinger, U., Bouyer, D., Weinl, C, Stierhof, Y-D. and Hulskamp, 
5 M. (2002) Proc. Natl. Acad. Sci. USA 99:6410-6415. } 
Senior, I.J. (1998) Biotech. Genet. Engin. Revs. 15:79-119, 

Shippy, R., Lockner, R., Farnsworth, M. and Hampel, A. (1999) Mol. Biotech. 12:117- 
129. 

10 

Smith, N.A., Singh, S.P., Wang, M.B., Stoutjesdijk, P.A., Green, A.G. and Waterhouse, 
P.M. (2000) 407:319-320. 

Solano, R., Nieto, C., Avila, J., Canas, L., Diaz, I. and Paz-Ares, J. (1995) EMBO J. 
15 14:1773-1784. 

Sorrell, D.A., Combettes, B., Chaubet-Gigot, N., Gigot, C. and Murray, J.A.M. (1999) 
Plant Physiol. 1 19:343-35 1 . 

20 Stracke, R., Werber, M. and Weisshaar, B. (2001) Curr. Opin. Plant Biol. 4:447-456. 
Szymanski, D.B. and Marks, M.D. (1998) Plant Cell 10:2047-2062. 

Szymanski, D.B., Lloyd, A.M. and Marks, M.D. (2000) Trends in Plant Science 5:214- 
25 219. 

Turley, R.B. and Ferguson, D.L. (1996) J. Plant Physiol. 149:695-702. 
Van't Hof, J. (1998) Am. J. Bot. 86:776-779. 

30 

Walker, A.R., Davison, P.A., Bolognesi-Winfield, A.C., James, C.M., Srinivasan, N., 
Blundell, T.L., Esch, J.J., Marks, M.D. and Gray, J.C. (1999) Plant Cell 1 1:1337-1350. 

Waterhouse, P.M., Graham, M.W., and Wang, M.-B. (1998) Proc. Natl. Acad. Sci. 
35 95:13959-13964. 



Copy provided by USPTO from the IFW Image Database on 04/19/2005 



62 

Wilson, D.L., Buckley, M.J., Helliwell, C.A. and Wilson I. W. (2003) Bioinformatics 
19:1325-1332. 

Wu, Y., Llewellyn, D.J. and Dennis, E.S. (2002) Plant Mol. Biol. Rep. 20:213-218. 

5 

Yu, X.H., Zhu, Y. Q., Lu, S., Zhang T.Z., Chen X.Y. and Xu, Z.H. (2000) Science in 
China (Series C) 43:623-630. 

Zimmet, J.M., Ladd, D., Jackson, C.W., Stenberg, P.E. and Ravid, K. (1997) Mol. Cell. 
10 Biol. 17:7248-7259. 



Copy provided bv USPTO from the IFW Image Database on 04/19/2005 



63 



CLAIMS 

1 . A method of altering fibre initiation and/or elongation in a fibre producing plant • 
comprising manipulating said plant such that the production of a polypeptide is 

5 modified when compared to a wild-type plant, wherein the polypeptide comprises a 
sequence selected from the group consisting of: 

i) an amino acid sequence provided as any one of SEQ ID NO's: 1 to 16; or 

ii) an amino acid sequence which is at least 50% identical to any one of SEQ ID 
NO's:l to 16. 

0 

2. The method of claim 1, wherein the polypeptide comprises a sequence selected 
from the group consisting of: 

i) an amino acid sequence provided as SEQ ID NO: 1; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO: 1. 

15 

3. The method of claim 1, wherein the polypeptide comprises a sequence selected 
from the group consisting of: 

i) an amino acid sequence provided as SEQ ID NO:2; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO:2. 

20 

4. The method of claim 1, wherein the polypeptide comprises a sequence selected 
from the group consisting of: 

i) an amino acid sequence provided as SEQ ID NO: 3; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO: 3. 

25 

5. The method according to any one of claims 1 to 4 which comprises 
recombinantly expressing the polypeptide in said plant. 

6. The method according to any one of claims 1 to 4 which comprises reducing the 
30 level of the polypeptide endogenously produced by the plant. 

7. The method of claim 6, wherein the level of the polypeptide endogenously 
produced by the plant is reduced by exposing the plant to an antisense polynucleotide 
or a catalytic polynucleotide which hybridizes to an mRNA molecule encoding the 

35 polypeptide. 
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8. The method of claim 6, wherein the level of the polypeptide endogenously 
produced by the plant is reduced by exposing the plant to a dsRNA molecule that 
specifically down-regulates mRNA levels in a cell of an mRNA molecule encoding the 
polypeptide. 

9. The method according to any one of claims 1 to 8, wherein the plant is a 
horticultural plant. 

10. The method according to any one of claims 1 to 8, wherein the plant is a species 
of the Genus Gossypium. 

11. A method of assessing the potential of a fibre producing plant to produce fibre, 
the method comprising analysing the plant for a genetic variation in a polynucleotide 
associated with fibre initiation and/or elongation, wherein the polynucleotide comprises 
a sequence selected from the group consisting of: 

i) a nucleotide sequence provided as any one of SEQ ID NO's: 17 to 45; or 

ii) a nucleotide sequence which is at least 50% identical to any one of SEQ ID 
NO's:17to45. 

12. The method of claim 1 1 which comprises performing an amplification reaction 
on nucleic acids obtained from said plant, or nucleic acids synthesized using nucleic 
acids from said plant as a template, wherein the production of an amplicon in said 
amplification reaction indicates an association with fibre producing potential. 

13. The method of claim 1 1 which comprises performing an amplification reaction 
on nucleic acids obtained from said plant, or nucleic acids synthesized using nucleic 
acids from said plant as a template, wherein the lack of production of an amplicon in 
said amplification reaction indicates an association with fibre producing potential. 

14. The method of claim 1 1 which comprises performing a hybridization reaction on 
nucleic acids obtained from said plant, or nucleic acids synthesized using nucleic acids 
from said plant as a template, wherein a detectable signal produced by the hybridization 
reaction indicates reduced.fibre producing potential. 

15. The method of claim 1 1 which comprises performing a hybridization reaction on 
nucleic acids obtained from said plant, or nucleic acids synthesized using nucleic acids 
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from said plant as a template, wherein the lack of a detectable signal by the 
hybridization reaction indicates reduced fibre producing potential. 

16. The method of claim 11, wherein the polynucleotide is mRNA and the method 
5 comprises determining the levels of mRNA of the polynucleotide in the plant ovule at, 

or around, anthesis. 

17. A method of assessing the potential of a fibre producing plant to produce fibre, 
the method comprising analysing the plant for polypeptide involved in fibre initiation 

10 and/or elongation, wherein the polypeptide comprises a sequence selected from the 
group consisting of: 

i) an amino acid sequence provided as any one of SEQ ID NO's: 1 to 16; or 

ii) an amino acid sequence which is at least 50% identical to any one of SEQ ID 
NO's:l to 16. 

15 

18. The method of claim 17, wherein the method comprises determining the levels 
of the polypeptide in the plant ovule at, or around, anthesis. 

1 9. A substantially purified polypeptide selected from the group consisting of: 

20 i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:l, 

ii) a polypeptide comprising an amino acid sequence which is at least 87% 
identical to SEQ ID NO:l, and 

iii) a biologically active fragment of i) or ii), 

25 wherein the polypeptide regulates fibre initiation and/or elongation. 

20. The polypeptide of claim 19, wherein the polypeptide comprises an amino acid 
sequence which is at least 95% identical to SEQ ID NO: 1 . 

30 21 . A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:2, and 

ii) a biologically active fragment of i), 

wherein the polypeptide regulates fibre initiation and/or elongation. 
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22. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:3, 

ii) a polypeptide comprising an amino acid sequence which is at least 54% 
5 identical to SEQ ID NO:3, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

23. A substantially purified polypeptide selected from the group consisting of: 

10 i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:4, 

ii) a polypeptide comprising an amino acid sequence which is at least 55% 
identical to SEQ ID NO:4, and 

iii) a biologically active fragment of i) or ii), 

1 5 wherein the polypeptide regulates fibre initiation and/or elongation. 

24. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:5, 

20 ii) a polypeptide comprising an amino acid sequence which is at least 50% 

identical to SEQ ID NO:5, and 

iii) a biologically active fragment of i) or ii), 
wherein the polypeptide regulates fibre initiation and/or elongation. 

25 25. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:6, 

ii) a polypeptide comprising an amino acid sequence which is at least 50% 
identical to SEQ ID NO:6, and 

30 iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

26. A substantially purified polypeptide selected from the group consisting of: 

ij a polypeptide comprising an amino acid sequence as provided in SEQ ID 

35 NO:7, 



How nwfifipH hv MSPTO from the IFW Image Database on 04/19/2005 



67 

ii) a polypeptide comprising an amino acid sequence which is at least 79% 
identical to SEQ ID NO: 7, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

5 

27. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:8, 

ii) a polypeptide comprising an amino acid sequence which is at least 66% 
1 0 identical to SEQ ID NO:8, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

28. A substantially purified polypeptide selected from the group consisting of: 

15 i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:9, 

ii) a polypeptide comprising an amino acid sequence which is at least 95% 
identical to SEQ ID NO:9, and 

iii) a biologically active fragment of i) or ii), 

20 wherein the polypeptide regulates fibre initiation and/or elongation. 

29. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 10, 

25 ii) a polypeptide comprising an amino acid sequence which is at least 67% 

identical to SEQ ID NO: 10, and 

iii) a biologically active fragment of i) or ii), 
wherein the polypeptide regulates fibre initiation and/or elongation. 

30 30. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 11, 

ii) a polypeptide comprising an amino acid sequence which is at least 55% 
identical to SEQ ID NO: 1 1 , and 

35 iii) a biologically active fragment of i) or ii), 

. wherein the polypeptide regulates fibre initiation and/or elongation. 
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31. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 12, 

ii) a polypeptide comprising an amino acid sequence which is at least 59% 
5 identical to SEQ ID NO: 1 2, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

32. A substantially purified polypeptide selected from the group consisting of: 

10 i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:13, 

ii) a polypeptide comprising an amino acid sequence which is at least 77% 
identical to SEQ ID NO: 13, and 

iii) a biologically active fragment of i) or ii), 

1 5 wherein the polypeptide regulates fibre initiation and/or elongation. 

33. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 14, 

20 ii) a polypeptide comprising an amino acid sequence which is at least 50% 

identical to SEQ ID NO: 14, and 

iii) a biologically active fragment of i) or ii), 
wherein the polypeptide regulates fibre initiation and/or elongation. 

25 34. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid* sequence as provided in SEQ ID 
NO:15, 

ii) a polypeptide comprising an amino acid sequence which is at least 64% 
identical to SEQ ID NO: 15, and I 

30 iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

35. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
35 NO: 16, 
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ii) a polypeptide comprising an amino acid sequence which is at least 50% 
identical to SEQ ID NO: 16, and 

iii) a biologically active fragment of i) or ii), 
wherein the polypeptide regulates fibre initiation and/or elongation. 

36. The polypeptide according to any one of claims 19 to 35, wherein the 
polypeptide can be purified from a species of the Genus Gossypium. 

37. The polypeptide according to any one of claims 19 to 36, which is a fusion 
1 0 protein further comprising at least one other polypeptide sequence. 

38. An isolated polynucleotide comprising a sequence of nucleotides selected from 

the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO: 17; 
15 ii) a sequence of nucleotides as provided in SEQ ID NO: 1 8; 

iii) a sequence encoding a polypeptide according to claim 19 or claim 20; 

iv) a sequence of nucleotides which is at least 87% identical to SEQ ID NO:17 

or SEQ ID NO: 18; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 

20 conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:46. 

39. An isolated polynucleotide comprising a sequence of nucleotides selected from 
25 the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO: 19, 

ii) a sequence of nucleotides as provided in SEQ ID NO:20, 

iii) a sequence encoding a polypeptide according to claim 21, and 

iv) a sequence complementary to any one of i) to iii). 

30 

40. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:21 ; 

ii) a sequence of nucleotides as provided in SEQ ID NO:22; 
35 iii) a sequence encoding a polypeptide according to claim 22; 
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iv) a sequence of nucleotides which is at least 54% identical to SEQ ID NO:21 

or SEQ ID NO:22; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

5 wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:47. 

41. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 
10 i) a sequence of nucleotides as provided in SEQ ID NO:23; 

ii) a sequence encoding a polypeptide according to claim 23; 

iii) a sequence of nucleotides which is at least 55% identical to SEQ ID NO:23; 

and 

iv) a sequence which hybridizes .to any one of i) to iii) under high stringency 
1 5 conditions., 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:48. 

42. An isolated polynucleotide comprising a sequence of nucleotides selected from 

20 the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:24; 

ii) a sequence of nucleotides as provided in SEQ ID NO:25; 

iii) a sequence encoding a polypeptide according to claim 24; 

iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:24 

25 or SEQ ID NO:25; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:49. 

30 

43. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:26; 

ii) a sequence of nucleotides as provided in SEQ ID NO:27; 
35 iii) a sequence encoding a polypeptide according to claim 25; 
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iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:26 
or SEQ ID NO:27; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

5 wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:50. 

44. An isolated polynucleotide comprising a sequence of nucleotides selected from 

the group consisting of: 
10 i) a sequence of nucleotides as provided in SEQ ID NO :28, 

ii) a sequence of nucleotides as provided in SEQ ID NO:29, 

iii) a sequence encoding a polypeptide according to claim 26, 

iv) a sequence complementary to any one of i) to iii). 

15 45. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:30; 

ii) a sequence of nucleotides as provided in SEQ ID NO:31; 

iii) a sequence encoding a polypeptide according to claim 27; 

20 iv) a sequence of nucleotides which is at least 65% identical to SEQ ID NO:30 

or SEQ ID NO:3 1 ; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
25 provided in SEQ ID NO:5 1 . 

46. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:32, 
30 ii) a sequence of nucleotides as provided in SEQ ID NO:33, 

iii) a sequence encoding a polypeptide according to claim 28, 

iv) a sequence complementary to any one of i) to iii). 

47. An isolated polynucleotide comprising a sequence of nucleotides selected from 

35 the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:34; 
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ii) a sequence of nucleotides as provided in SEQ ID NO:35; 

iii) a sequence encoding a polypeptide according to claim 29; 

iv) a sequence of nucleotides which is at least 70% identical to SEQ ID NO:34 

or SEQ ID NO:35; and 
5 v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:52. 

10 48. An isolated polynucleotide comprising a sequence of nucleotides selected from 

the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:36; 

ii) a sequence of nucleotides as provided in SEQ ID NO:37; 

iii) a sequence encoding a polypeptide according to claim 30; 

15 iv) a sequence of nucleotides which is at least 55% identical to SEQ ID NO:36 

or SEQ ID NO:37; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
20 provided in SEQ ID NO:53. 

49. An isolated polynucleotide comprising a sequence of nucleotides selected from 

the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:38; 
25 ii) a sequence encoding a polypeptide according to claim 31; 

iii) a sequence of nucleotides which is at least 65% identical to SEQ ID NO:38; 

and 

iv) a sequence which hybridizes to any one of i) to iii) under high stringency 
conditions. 

30 

50. An isolated polynucleotide comprising a sequence of nucleotides selected from 

the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:39; 

ii) a sequence of nucleotides as provided in SEQ ID NO.40; 
35 iii) a sequence encoding a polypeptide according to claim 32; 
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iv) a sequence of nucleotides which is at least 95% identical to SEQ ID NO:39 

or SEQ ID NO:40; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:54. 

51. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:41; 

ii) a sequence of nucleotides as provided in SEQ ID NO:42; 

iii) a sequence encoding a polypeptide according to claim 33; 

iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:41 

or SEQ ID NO:42; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:55. 

52. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:43; 

ii) a sequence of nucleotides as provided in SEQ ID NO:44; 

iii) a sequence encoding a polypeptide according to claim 34; 

iv) a sequence of nucleotides which is at least 65% identical to SEQ ID NO:43 

or SEQ ID NO:44; and 

v) a sequence which hybridizes to any one of i) to. iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:56. 

53. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:45; 
35 ii) a sequence encoding a polypeptide according to claim 35; 



10 



15 



20 



25 



30 
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iii) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:45; 

and - 

iv) a sequence which hybridizes to any one of i) to iii) under high stringency 
conditions, 

5 wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:57. 

54. A catalytic polynucleotide capable of cleaving a polynucleotide according to 
any one of claims 38 to 53. 

10 

55. The catalytic polynucleotide of claim 54 which is a ribozyme. 

56. An oligonucleotide which comprises at least 19 contiguous nucleotides of a 
polynucleotide according to any one of claims 38 to 53. 

15 

57. A double stranded RNA (dsRNA) molecule comprising an oligonucleotide 
according to claim 56, wherein the portion of the molecule that is double stranded is at 
least 19 basepairs in length and comprises said oligonucleotide. 

20 58. The dsRNA molecule of claim 57 which is expressed from a single promoter, 
wherein the strands of the double stranded portion are linked by a single stranded 
portion. 

59. A vector comprising or encoding the polynucleotide according to any one of ^ 
25 claims 38 to 53. 

60. The vector of claim 59 wherein the polynucleotide is operably linked to an ovule 
or fibre specific promoter. 

30 61. A vector comprising or encoding oligonucleotide of claim 56 or the dsRNA 
molecule of claim 57. 

62. A host cell comprising the vector according to any one of claims 59 to 61 . 

35 63. A transgenic plant, the plant having been transformed with polynucleotide 
according to any one of claims 38 to 53 or the oligonucleotide of claim 56. 
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64. The transgenic plant of claim 63 wherein the polynucleotide is capable of 
expression to produce a polypeptide according to any one of claims 1 9 to 37. 

65. A transgenic plant, the plant having been transformed such that it produces a 
5 catalytic polynucleotide of claim 54 or claim 55, or a dsRNA molecule of claim 57 or 

claim 58. 

66. The transgenic plant of claim 63 or claim 65, wherein the polynucleotide, 
catalytic polynucleotide or dsRNA down-regulates the production of a polypeptide 

10 according to any one of claims 19 to 37 which is endogenously produced by the plant. 

67. A substantially purified antibody, or fragment thereof, that specifically binds a 
polypeptide according to any one of claims 19 to 37. 

15 68. A method of breeding a fibre producing plant, the method comprising 
performing a method according to any one of claims 1 1 to 18. 

69. A method of selecting from a breeding population a fibre producing plant with 
altered fibre initiation and/or elongation potential, the method comprising; 

20 i) crossing two plants which have differing potential to produce fibre, 

ii) performing a method according to any one of claims 11 to 18 on progeny 

plants, 

iii) selecting a progeny plant with altered fibre initiation and/or elongation 
potential when compared to a parent plant. 

25 

70. A plant produced by the method of claim 68 or claim 69. 

7 1 . Seed of a plant of any one of claims 63 to 66 or claim 70. 
30 72. 'Fibre of a plant of any one of claims 63 to 66 or claim 70. 

73. A method of identifying an agent which alters fibre initiation and/or elongation 
of a fibre producing plant, the method comprising 

a) exposing a polypeptide which is at least 50% identical to any one of SEQ ID 
35 NO's:l to 16 to a candidate agent, and 
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b) assessing the ability of the candidate agent to modulate the activity of the 
polypeptide. 

74. A method of identifying an agent which alters fibre initiation and/or elongation 
5 of a fibre producing plant, the method comprising 

a) exposing a polypeptide which is at least 50% identical to any one of SEQ ID 
NO's:l to 16 to a binding partner which binds the polypeptide, and a candidate agent, 
and 

b) assessing the ability of the candidate agent to compete with the binding 
1 0 partner for binding to the polypeptide. 

.75. The method of claim 74, wherein the binding partner is detectably labeled. 

76. A method of identifying an agent which alters fibre initiation and/or elongation 
15 of a fibre producing plant, the method comprising 

a) exposing a polynucleotide encoding a polypeptide which is at least 50% 
identical to any one of SEQ ID NOV.l to 16 to a candidate agent under conditions 
which allow expression of the polynucleotide, and 

b) assessing the ability of the candidate agent to modulate levels of polypeptide 
20 produced by the polynucleotide. 

77. A method of identifying an agent which alters fibre initiation and/or elongation 
of a fibre producing plant, the method comprising 

a) exposing a polynucleotide which is at least 50% identical to any one of SEQ 
25 ID NO's:17 to 45 to a candidate agent, and 

b) assessing the ability of the candidate agent to hybridize and/or cleave the 

polynucleotide. 
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SEQUENCE LISTING 



<110> 


Commonwealth Scientific 


<120> 


Genes involved in plant 


<130> 


502135 


<160> 


72 


<170> 


Patentln version 3.1 


<210> 


1 


<211> 


624 


<212> 


PRT 


<213> 


Gossypium hirsutum 


<400> 


1 



Val Lys Phe Trp Phe Gin Asn Lys Arg Thr Gin Met Lys Ala Gin His 
1 5 10 15 

Glu Arg His Glu Asn Ala He Leu Lys Ala Glu Asn Glu Lys Leu Arg 
20 25 30 

Ala Glu Asn Asn Arg Tyr Lys Glu Ala Leu Ser Asn Ala Thr Cys Pro 
35 "40 45 

Ser Cys Gly Gly Pro Ala Ala Leu Gly Glu Met Ser Phe Asp Glu Gin 
50 ' 55 60 

His Leu Arg He Glu Asn Ala Arg Leu Arg Glu Glu He Asp Arg He 
65 70 75 80 

Ser Gly He Ala Ala Lys Tyr Val Gly Lys Pro Leu Ser Ser Leu Pro 
85 90. 95 

His Leu Ser Ser His Leu His Ser Arg Ser Ala Asp Leu Gly Ala Ser 
100 ' 105 HO 

Asn Phe Gly Asn Gin Ser Gly Phe Val Gly Glu Met Asp Arg Ser Gly 
115 12°. ■ 125 

Asp Leu Leu Arg Ser Val Ser Gly Pro Thr Glu Ala Asp Lys Pro Met 
130 135 140 . 

He Val Glu Leu Ala Val Ala Ala Met Glu Glu Leu He Arg Met Ala 
145 150 155 160 
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Gin Ser Gly Glu Pro Leu Trp Val Pro Gly Asp Asn Ser Thr Asp Val 

165 170 I 75 

Leu Asn Glu Asp Glu Tyr Leu Arg Thr Phe Pro Arg Gly lie Gly Pro 
180 185 190 

Lvs Pro Leu Gly Leu Arg Ser Glu Ala Ser Arg Glu Ser Ala Val Val 
y 195 200 205 

lie Met Asn His Val Asn Leu Val Glu lie Leu Met Asp Val Asn Gin 
210 215 220 

Trp Ser Ser Val Phe Cys Gly He Val Ser Arg Ala Met Thr Leu Glu 
225 230 235 240 

Val Leu Ser Thr Gly Val Ala Gly Asn Tyr Asn Gly Ala Leu Gin Val 
245 250 255 

•Met Thr Ala Glu Phe Gin Val Pro Ser Pro Leu Val Pro Thr Arg Glu 
260 ' . 265 270 

Asn Tyr Phe Ala Arg Tyr Cys Lys Gin His He Asp Gly Thr Trp Ala 
275 280 285 

Val Val Asp Val Ser Leu Asp Asn Leu Arg Pro Asn Pro Met Ser Ser 
290 295 300 

Val Glu Arg Pro Ser Gly Cys Leu He Gin Asn Cys Gin Met Asp Thr 
305 " 310 315 320 

Ser Lys Val He Trp Val Glu His Val Glu Val Asp Asp Arg Ala Val 
325 330 335 

His Asn He Tyr Arg Pro Val Val Asn Ser Gly Leu Ala Phe Gly Ala 
340 345 350 

Lvs Arg Trp Val Ala Thr Leu Asp Arg Gin Cys Glu Arg Leu Ala Ser 
355 • 360 365 

Ser Met Ala Ser Asn He Pro Ala Gly Gly Leu Cys Val He Thr Ser 
370 375 380- 

Pro Glu Gly Arg Lys Ser Met Leu Lys Leu Ala Glu Arg Met Val Thr 
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385 



390 



395 



400 



Ser Phe Cys Thr Gly Val Gly Ala Ser Thr Ala His Ala Trp Thr Thr 
405 410 415 

Leu Ser Ala Thr Gly Ser Asp Asp Val Arg Val Met Thr Arg Lys Ser 
420 425 m 430 

Met Asp Asp Pro Gly Arg Pro Pro Gly He Val Leu Ser Ala Ala Thr 
435 440 445 

Ser Phe Trp He Gin Val Pro Pro Lys Arg Val Phe Asp Phe Leu Arg 
450 455 460 

Asp Glu Asn Ser Arg Ser Glu Trp Asp He Leu Ser Asn Gly Gly Leu 
465 470 475 480 

Val Gin Glu Met Ala His He Ala Asn Gly Arg Asp Pro Gly Asn Cys 
485 490 495 

Val Ser Leu Leu Arg Val Asn Ser Ala Asn Ser Ser Gin Ser Asn Met 
500 . 505 510 

Leu He Leu Gin Glu Ser Cys Thr Asp Ala Lys Gly Ser Tyr Val He 
515 520 525 

Tyr Ala Pro Val Asn He Val Ala Met Asn He Val Leu Ser Gly Gly 
530 535 540 

Asp Pro Asp Tyr Val Ala Leu Leu Pro Ser Gly Phe Ala He Leu Pro 
545 ■ 550 555 560 

AsD civ Pro Gly Val Asn Gly Gly Gly He Leu Glu He Gly Ser Gly 
565 ~ 570' 575 

Gly Ser Leu Leu Thr Val Ala Phe Gin He Leu Val Asp Ser Val Pro 
580 585 590 

Thr Ala Lys Leu Ser Leu Gly Ser Val Ala Thr Val Asn Ser Leu He 
595 600 605 

Lys Cys Thr Val Glu- Arg He Lys Ala Ala Val Lys Cys Asn Asn Ala 
610 615 620 
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<210> 2 

<211> 309 

<212> PRT 

<213> Gossypium hirsutum 



<400> 2 

Met Gly Arg Ser Pro Cys Cys Glu Lys Val Gly Leu Lys Lys Gly Pro 

1 5 10 15 

Trp Thr Pro Glu Glu Asp Gin Lys Leu Leu Ala Tyr" He Glu Gin His 



20 



25 30 



Glv His Gly Ser Trp Arg Ala Leu Pro Ser Lys Ala Gly Leu Gin Arg 
35 40 45 

Cvs Gly Lys Ser Cys Arg Leu Arg Trp He Asn Tyr Leu Arg Pro Asp 
50 55 • 60 



lie Lys Arg Gly Lys Phe 
65 " 70 



Ser Leu Gin Glu Glu Gin Thr He lie Gin 
75 80 



Leu His Ala Leu Leu Gly Asn Arg Trp Ser Ala He Ala Thr His Leu 
85 9° 95 

Pro Lys Arg Thr Asp Asn Glu He Lys Asn Tyr Trp Asn Thr His Leu 
100 • 1° 5 . 110 

Met Lys Arg Leu Thr Lys Met Gly He Asp Pro Val Thr His Lys Pro 
115 120 125 

Lvs Thr Asp Ala Leu Gly Ser Thr Thr Gly Asn Pro Lys Asp Ala Ala 
130 . 135 140 

Asn Leu Ser His Met Ala Gin Trp Glu Ser Ala Arg Leu Glu Ala Glu 
145 150 155 160 

Ala Arg Leu Val Arg Glu Ser Lys Leu Val Pro Ser Asn Pro Pro Gin 
165 . 170 175 

Ser Asn His Phe Thr Ala Val Ala Pro Ser Pro Thr Pro Ala Thr Arg 
180 185 190 
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Pro Gin Cys Leu Asp Val Leu Lys Ala Trp Gin Gly Val Val Cys Gly 
195 200 205 



Leu Phe Thr Phe Asn Met Asp Asn Asn Asn Leu Gin Ser Pro Thr Ser 
210 215 220 * 



Thr Leu Asn Phe Met Glu Asn Thr Thr Thr Leu Pro Met Ser Ser Ser 

225 230 235 240 

Ser Ser Val Asn Gly Met Phe Asn Glu Asn Phe Gly Trp Asn Ser Ser 

245 250 255 



He Asn Pro Cys Glu Ser Gly Asp Asn Leu Lys Val Glu Tyr Gly Ser 
260 265 270 

Asp Gin He Pro Glu Leu Lys Glu Arg Leu Asp His Pro Met Glu Leu 
275 280 285 

His Glu Met Asp Tyr Ser Ser Glu Gly Thr Trp Phe Gin Glu Leu Phe 
290 295 300. 



Gly Phe Asn Gly Leu 
305 



<210> 3 

<211> 150 

<212> PRT 

<213> Gossypium hirsutum 

<400> 3 

Arg Cys Glu Arg Leu Leu Leu Cys Val He Ser Asp Ala Arg Ser He 
! 5-10 15 



His Tyr Leu Pro Ser Val Leu Ala Thr Ala Thr Met Met His Val He 
20 25 30 

Asp Gin Val Glu Leu Phe Asn Pro He Asp Tyr Gin Asn Gin Leu Leu 
35 40 . -45 

Ser Val Leu Lys He Ser Lys Glu Lys Val Asn Asp Cys Tyr Lys Leu 
50 55 60 



He Leu Asp Val Ser Thr Arg Pro Gin Ala Gin Gly Asn Gly Gly Ala 
65 70 75 80 
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Cys Lys Arg Lys Val Glu Glu Arg Val Pro Ser Ser Pro Ser Gly Val 



85 



lie Asp Ala Ala Phe Gly Ser Asp Ser Ser Ser Asp Ser Trp Gly Thr 
100 105 11U 

Val Ser Leu Ser Pro Glu Gin Gin Pro Pro Phe Lys Lys Ser Arg Ala 
115 120 125 

Gin Glu Gin Val Met Arg Leu Pro Ser Leu Asn Arg Val Phe Val Asp 
130. 135 I 40 

He Val Gly Ser Pro Ser 
145 I 50 
<210> 4 
<211> 229 
<212> PRT 

<213> Gossypium hirsutum 
<400> 4 

Met Ala Asn His Thr Val Thr Phe Leu Pro Lys Leu Ser He Glu Ala 
1.5 1° xi> 

He Gin Thr Val Thr Pro Met Arg He Thr Glu Pro Arg- Gin Thr Arg 
20 . 25 30 

Gin Val Leu Ala Gly Glu Leu Val Gly Pro Gly He Phe Gin Arg Cys 
35 " 40 45 

Leu Asn Val Val Gin Tyr Tyr Met Lys Glu Lys Glu Glu Asp Ser Gly 
50 55 60 



Trp Leu Leu Ala Gly Trp He Lys Glu Thr Leu Gly Arg Ala Leu Hi 
65 70 75 80 



s 



Glu Gin Pro Met He Ser Gly Arg Leu Arg Lys Gly Glu Arg Asn Asp 
85 90 95 

Glv Glu Leu Glu He Val Ser Asn Asp Cys Gly He Arg Leu He Glu 
100 105 ■ HO 

Ala Arg He Gin Met Asn Leu Ser Asp Phe Leu Asp Leu Lys Gin Arg 
115 12° I 25 
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Glu Asp Ala Glu Ala Gin Leu Val Phe Trp Lys Asp lie Asp Glu Gin 
130 135 140 

Asn Pro Gin Phe' Ser Pro Leu Phe Tyr Val Gin Val Thr Asn Phe Gin 
145 " 150 155 160 

Cys Gly Gly Tyr Ser lie Gly He Ser Cys Ser He Leu Leu Ala Asp 
y 165. I 70 175 

Leu Leu Leu Met Lys Glu Phe Leu Lys Thr Trp Ala Asp lie Pro Thr 
180 185 19° 

Arg Leu Leu Ser Thr Lys Thr Met Asn Lys Ser Phe Leu Tyr Ser Thr 
195 200 205 

Phe Leu Ala Glu Lys His Gin Trp Cys Leu Pro Thr Ser Ser His Gin 

210 215 220 

lie Gin Ala Lys Leu 
225 

<210> 5 
<211> 107 
<212> PRT 

<213> Gossypium hirsutum 
<400> 5 

Met Ala Lys Tyr Leu. Asn Val Val Leu Val Leu Ala Leu Val Val Val 
15 10 15 

Gin Ala Thr Ala Arg Asn Val Pro Ser Asp Ala 'Ala Gly Leu Asn Asp 
20 25 30 

Gin Lys Asn Leu Leu Thr Tyr Gly Gly He Gly Gly Tyr Ser Gly Met 
35 40 45 

Gly Ser Asn Gly Met Pro Met Gly Gly Val Gly Ser Val Gly Gly Met 
50 55 60 

Thr Gly Leu Gly Gly Thr Gly Gly Met Gly Ala Met Val Gly Val Gly 
65 70 75 80 

Tyr Gly Gly Gly Pro Gly Ala Gly Gly Gly Asn Glu Gly Gly Val Gly 
85 90 
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lie Glv Asn Ala Pro Gly Val Val His Phe Pro 
100 105 • 



<210> 6 
<211> 112 
<212> PRT 

<213> Gossypium hirsutum 
<400> 6 

Ser Ser Asp Ser Arg Lys Pro Leu Ala Ser Phe Tyr Leu Glu Lys Thr 
15 10 15 

Lys Lys Leu Leu Leu Cys Trp Thr Cys Ser Cys Phe Phe Ser Leu Tyr 
20 " 25 30 

Gly Val Val Tyr Gly Leu Tyr Tyr Glu Phe Tyr Met Asn Arg Thr Leu 
35 40 . 45 

Asn Leu Val Arg Lys Leu Arg Met Ser Leu Gly Gly Ala Glu Val Leu 
50 55 60 

Met Ala He Ala Gly Leu- Trp Ala. Val Val Leu Arg Pro Leu Met lie 
65 70 75 80 

Arg Tyr Ala Val -Glu Met Ser Gin Met He Gly He Ser Val Arg Arg 
85 9° 95 

Phe Phe Ser Asn Pro Leu Ser Pro Ser Val Ser Phe Phe Tyr Trp. Tyr. 
100 105 HO 



<210> 7 

<211> 258 

<212> PRT 

<213> Gossypium hirsutum 

<400> 7 

Met Ala Thr Lys Thr Met Met Leu Gin He Phe Pro Leu Phe Phe Phe 
! 5 1° 15 

Leu Phe Ser Val Cys Asn Ser He Phe Leu Gly Ala Asn Gly Asp Asp 
20 25 30 

Asn Gly Gly Trp Gin Thr Ala His Ala Thr Phe Tyr Gly Gly Ala Asp 
35 ^ 40 45 
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Ala Thr Gly Thr Met Gly Gly Ala Cys Gly Tyr Gly Asn Leu Tyr Ser 
50 55 60 . 

Gin Gly Tyr Gly Thr Ser Thr Ala Ala Leu Ser Thr Ala Leu Phe Asn 
65 70 75 80 

Asn Gly Leu Ser Cys Gly Ala Cys Tyr Glu Leu Arg Cys Asn Asn Asp 
85 9° 95 

Pro Gin Trp Cys lie Ser Arg Thr lie Thr Val Thr Ala Thr Asn Phe 
100 105 HO 

Cys Pro Pro Asn Tyr Ala Leu Ser Ser Asp Asn Gly Gly Trp Cys Asn 
115 " 120 125 

Pro Pro Arg Glu His Phe Asp Leu Ala Glu Pro Arg Phe Leu Arg He 
130 • 135 140 

Ala Glu Tyr Arg Ala Gly He Val Pro Val Met Phe Arg Arg Val Ser _ 
145 " * 150 155 160 

Cys Val Lys Lys Gly Gly He Arg Tyr Thr Met Asn Gly His Ser Tyr 

Phe Asn Met Val Leu He Thr Lys Leu Gly Gly Ala Gly Asp He Thr 
180 . 185 190 

Ser Val Ser He Lys Gly Ser Arg Thr Gly Trp Leu Pro Met Ser Arg 
195 200 205 

Asn Trp Gly Gin Asn Trp Gin Ser Asn Ala Tyr Leu Asn Gly " Gin Ser 
210 215 220 

Leu Ser Phe Lys Val Thr Ala Ser Asp Gly Arg Thr He Thr Ala Tyr 
225 230 235 240 

Asn Val Val Pro Ala Gly Trp Gin Phe Gly Gin Thr Phe Glu Gly Gly 
245 250 255 



Gin. Phe 



<210> 8 
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<211> 
<212> 
<213> 



190 
PRT 

Gossypium hirsutum 



<400> 8 

Val Pro Phe Tyr Ser Ser Asn Tyr Leu Leu His Glu Ser Cys Met Met 
1 5 10 15 

Met lie Ala Ser Leu Val Pro Asn Phe Met Met Gly Val lie lie Gly 
20 25 30 

Ala Gly Tyr lie Gly Leu Leu Met Met Thr Ala Gly Tyr Phe Arg Leu 
«35 40 45 

Leu Pro Asp Leu Pro Lys He Phe Trp Arg Tyr Pro Val Ser Tyr He 
50 55 60 

Asn Tyr Gly Ala Trp Ala Leu Gin Gly Ala Tyr Lys Asn Asp Met Val 
65 70 75 

Gly Leu Glu Phe Asp Gly Phe He Pro Gly Gly Pro Lys Leu Lys Gly 
85 9° 9t> 

Asp Val Val Leu Thr Ser Met Leu Gly He His Leu Asp His Ser Lys 
100 105 110 

Trp Trp Asp Leu Ala Ala Val He Met He Leu He Ala Tyr Arg Leu 

115 120 125 

Leu Phe Phe He He Leu Lys Phe Lys Glu Arg Val Ser Pro Leu Phe 

130 135 ■ - I 40 

Arg Thr Leu Tyr Thr Trp Arg Thr Leu Gin His Met Lys Lys Arg Pro 
14 5 150 155 160 

Ser Phe Arg Lys Thr Ser Ala Phe Pro Ser Lys Arg His Gin Val Leu 
165 I 70 175 

His Ser Leu Ser Ser Gin Glu Gly Leu Asn Ser Pro He His 
180 185 I 90 



<210> 9 

<211> 805 

<212> PRT 

<213> Gossypium hirsutum 
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<400> 9 

Met Ala. Asn Pro Val lie Thr Arg Val His Ser Leu Arg Glu Arg Leu 
1 5 10 15 

Asp Glu Thr Leu Leu Ala His Arg Asn Glu lie Leu Ala LeU Leu Ser 
20 25 30 

Arg He Glu Gly Lys Gly Lys Gly lie Leu Gin His His Gin lie He 
35 40 45 

Leu Glu Phe Glu' Ala He Pro Glu Glu Asn Arg Lys Lys Leu Ala Asp 
50 55 60 

Gly Ala Phe Phe Glu Val Leu Lys Ala Ser Gin Glu Ala He Val Leu 
65 .70 75 80 

Pro Pro Trp Val Ala Leu Ala Val Arg Pro Arg Pro Gly Val Trp Glu 
85 9° 95 

Tyr He Arg Val Asn Val His Ala Leu Val Val Glu Glu Leu Thr Val 
^ 100 105 HO 

Ala Glu Tyr Leu His Phe Lys Glu Glu Leu Val Asp Gly Ser Ser Asn 
115 120 125 

Gly Asn Phe Val Leu Glu Leu Asp Phe Glu Pro Phe Asn Ser Ser Phe 
130 135 140 

Pro Arg Pro Thr Leu Ser Lys Ser Val Gly Asn Gly Val Glu Phe Leu 
c 150 155 loO* 

Asn Arg His Leu Ser Ala Lys Leu Phe His Asp Lys Glu Ser Met His 
165 170 1"5 

Pro Leu Leu Glu Phe Leu Arg Val His Cys His Lys Gly Lys Asn Met 
180 185 I 90 

Met Leu Asn Asp Arg He Gin Asn Leu Asn Ala Leu Gin His Val Leu 
195 200 205 

Arg Lys Ala Glu Glu Tyr Leu Gly Thr Leu Pro Pro Glu Thr Pr6 Cys 
210 215 220 

Ala Gly Phe Glu His Arg Phe Gin Glu He Gly Leu Glu Arg Gly Trp 
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225 



230 235 240 



Glv Asp Thr Ala Gin Arg Val Leu Glu Met lie Gin Leu Leu Leu Asp 
y 245 ' 250 255 

Leu Leu Glu Ala Pro Asp Pro Cys Thr Leu Glu Lys Phe Leu Gly Arg 
260 ' 265 270 

He Pro Met Val Phe Asn Val Val He Leu Thr Pro His Gly Tyr Phe 
275 280 285 

Ala Gin Asp Asn Val Leu Gly Tyr Pro Asp Thr Gly Gly Gin Val Val 
290 ' 295 300 

Tvr He Leu Asp Gin Val Arg Ala Leu Glu Asn Glu Met Leu Leu Arg 
305 ^ 310 315 320 

He Lys Gin Gin Gly Leu Asn He Thr Pro Arg He Leu He He Thr 
325 330 335 

Arg Leu Leu Pro Asp Ala Val Gly Thr Thr Cys Gly Gin Arg Leu Glu 
340 345 350 

Lvs Val Tyr Gly Thr Glu Tyr Ser Asp He Leu Arg Val Pro Phe Arg 
355 " 360 365 

Thr Glu Lys Gly He Val Arg Lys Trp He Ser Arg Phe Glu Val Trp 
370 375 380 

Pro Tyr Leu Glu Thr Tyr Thr Glu Asp Val Ala His Glu He Ser Lys 
385 390 395 400 

Glu Leu Gin Gly Lys Pro Asp Leu He He Gly Asn Tyr Ser Asp Gly 
405 410 415 

' Asn He Val Ala Ser Leu Leu Ala- His Lys Leu Gly Val Thr Gin Cys 
420 425 430 

Thr He Ala His Ala Leu Glu Lys Thr Lys Tyr Pro Asp Ser Asp He 
435 440 445 

Tvr Trp Lys Lys Leu Glu Asp Lys Tyr His Phe Ser Cys Gin Phe Thr 
450 455 460 
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Ala Asp Leu Phe Ala Met Asn His Thr Asp Phe He He Thr Ser Thr 

Phe Gin Glu He Ala Gly Ser Lys Asp' Thr Val Gly Gin Tyr Glu Ser 
485 . 490 495 

His Thr Ala Phe Thr Leu Pro Gly Leu Tyr Arg Val Val His Gly He 
500 505 510 

Asp Val Phe Asp Pro Lys Phe Asn He Val Ser Pro Gly Ala Asp Met 
515 520 525 

Glu He' Tyr Phe Pro Tyr Thr Glu Glu Lys Arg Arg Leu Lys His Phe 
530 . 535 540 



His 



Thr Glu He Glu Asp Leu Leu Tyr Ser Lys Val Glu Asn Glu Glu 



545 550 



555 • 560 



His Leu Cys Val Leu Asn Asp Arg Asn Lys Pro He Leu Phe Thr Met 
565 . 570 575 

Ala Arg Leu Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Trp Tyr 
580 585 590 

Gly Lys Asn Ala Lys Leu Arg Glu Leu Ala Asn Leu Val Val Val Gly 
595 600 605 

Gly Asp Arg Arg Lys Glu Ser Lys Asp Leu Glu Glu Lys Ala Glu Met 
610 ~ 615 620 

Lys Lys Met Phe Glu Leu He Glu Lys Tyr Asn Leu Asn Gly Gin Phe 
625 630 635 MU 

Arq Trp He Ser Ser Gin Met Asn Arg He Arg Asn Gly Glu Leu Tyr 
y 645 650 655 

Arg Tyr He Cys Asp Thr Lys Gly Ala Phe Val Gin Pro Ala Leu Tyr 
660 665 670 

Glu Ala Phe Gly Leu Thr Val Val Glu Ala Met Thr Cys Gly Leu Pro 
675 680 685 
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Thr Phe Ala Thr Cys Asn Gly Gly Pro Ala Glu He He Val His Gly 
690 695 700 

Lys Ser Gly Phe Asn He Asp Pro Tyr His Gly Asp Gin Ala Ala Asp 
705. " 710 715 720 

He Leu Val Asp Phe Phe Glu Lys Cys Lys Lys Asp Pro Ser His Trp 
725' 730 . , • 735 

Asp Lys He Ser Gin Gly Gly Leu Lys Arg He Glu Glu Lys Tyr Thr 
740 • 745 750 

Trp Lys He Tyr Ser Glu Arg Leu Leu Thr Leu Thr Gly Val Tyr Gly 
755 760 765 

Phe Trp Lys His Val Ser Asn Leu Glu Arg Arg Glu Ser Arg Arg Tyr 
770 775 780 

Leu Glu Met Phe Tyr Ala Leu Lys Tyr Arg Lys Leu Ala Glu Ser Val 
785 790 795 800 

Pro Leu Ala Glu Glu 
805 



<210> 10 

<211> 195 

<212> PRT 

<213> Gossypium hirsutum 

<400> 10 

Met Glu Arg Gly Phe He Val Leu Ala. Leu Thr Val Val Phe Ala Ala 
1 5 10 . 15 

Thr Val Val Thr Ala Ala Asp Glu Ser Gly Leu Ala Asn Glu Cys Ser 
20 25 30 

Lys Asp Phe Gin Ser Val Met Thr Cys Leu Ser Phe Ala Gin Gly Lys 

35 .40 45 

Ala Ala Ser Pro Ser Lys Glu Cys Cys Asn Ser Val Ala Gly He Lys 
50 55 60 

Glu Asn Lys Pro Lys Cys Leu Cys Tyr He Leu Gin Gin Thr Gin Thr 
65 70 . 75 80. 
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Ser Gly Ala Gin Asn Leu Lys Ser Leu Gly Val Gin Glu Asp Lys Leu 
85 90 95 

Phe Gin Leu Pro Ser Ala Cys Gin Leu Lys Asn Ala Ser Val Ser Asp 
100 1° 5 110 

Cys Pro Lys Leu Leu Gly Leu Ser Pro Ser Ser Pro Asp Ala Ala He 
115 I 20 125 

Phe Thr Asn Ser Ser Ser Lys Ala Thr Thr Pro Ser Thr Ser Thr Thr 
130 135 140 

Thr Ala Thr Pro Ser Ser Ala Ala Asp Lys Thr Asp Ser Lys Ser Ser 
145 I 50 

Gly He Lys Leu Gly Pro His Phe Val Gly Ser Thr Ala Ala Leu Leu 
165 I 70 

Val Ala Thr Ala Ala Val Phe Phe Leu Val Phe Pro Ala Gly Phe Ala 
180 185 190 



Ser lie Val 
195 



<210>' 11 

<211> 123 

<212> PRT 

<213> Gossypium hirsutum 

<400> 11 

Met Ala Ser Ser Gly Val Leu Lys Leu Val Ser Met He Leu Met Val 
•! 5 • ■ 10 15 

Cys Met Thr Met Met Ser Ala Pro Lys Ala Ala Lys Ala Ala He Thr 
20 25 30 

Cvs Ser Asp Val Val Asn His Leu He Pro Cys Leu Ser Tyr Val Gin 
35 40 45 

Asn Gly Gly Thr Pro Ala Ala Ala Cys Cys Ser Gly Val Lys Ala Leu 

50 " 55 60 

Tyr Gly Glu Val Gin Thr Ser Pro Asp Arg Gin Asn Val Cys Lys Cys 

65 ' 70 75 . 80 



rt 
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He Lys Ser Ala Val Asn Gly He Pro Tyr Thr Ser Asn Asn Leu Asn 
85 90 95 

Leu Ala Ala Gly Leu' Pro Ala Lys Cys Gly Leu Gin Leu Pro Tyr Ser 
10 0 105 HO 

lie Ser Pro Ser Thr Asp Cys Asn Lys Val Gin 
115 120 

<210> 12 

<211> 282 

<212> PRT 

<213> Gossypium hirsutum _ 

<400> 12 

Pro Arg Val Arg Pro Arg Val Arg Ala His Leu Pro Lys Arg Thr Asp 
1 5 10 I 5 

Asn Glu He Lys Asn Tyr Trp Asn Thr Gin Leu Lys Lys Arg Leu Thr 
20 25 30 

Thr He Gly He Asp Pro Ala Thr His Arg Pro Lys Thr Asp Thr Leu 
35 40 45 

Gly Ser Thr Pro Lys Asp Ala Ala Asn Leu Ser His Met Ala Gin Trp 
50 55 . -60 

Glu Ser Ala Arg Leu Glu Ala Glu Ala Arg Leu Val Arg Glu Ser Lys 
65 70 75 80 

Arq Val Ser Asn Pro Ser Gin Asn Gin Phe Arg Phe Thr Ser Ser Ser 
85 90 95 . 

Ala Pro Pro Leu Val- Ser Lys He Asp Val Gly Leu Ala His Ala Thr 
100 105 HO 

Lvs Pro Gin Cys Leu Asp Val Leu Lys Ala Trp Gin Arg Val Val Thr 
115 120 125 

Glv Leu Phe Thr Phe Asn Thr Asp Asn Leu Gin Ser Pro Thr Ser Thr 
130 135 140 

Ser Ser Phe Thr Glu Asn Thr Leu Pro He Ser Ser Val Gly Phe He 
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145 150 155 160 

Asp Ser Phe Val Gly Asn Ser Asn Asn Ser Cys Cys Gly Asn Asn Trp 
165 170 175 

Glu Cvs Val Glu Lys Ser Ser Gin Val Ala Glu Leu Gin Glu Arg Leu 
180 185 190 

Asp Asn Ser Met Gly Leu His Asp He. Leu Asp Leu Ser Ser Glu Asp 
. 195 200 205 

Val Trp Phe Gin Gly Ser Tyr Arg Ala Glu Asn Met Met Glu Gly Tyr 
210 ~ 215 220 

Ser Asp Thr Leu Met Val Cys Asp Ser Gly Asp His Pro Lys Ser Leu 
225 230 * 235 240 

Ser Met Glu Pro Arg Gin Asn Phe Asn Val Gly Thr Ser Asn Ala Ser 
245 • 250 255 

Ser Phe Glu Glu Asn Lys Asn Tyr Trp Asn Asn He Leu Asn Phe Ala 
260 265 270 

Asn Ala Ser Pro Ser Gly Ser Ser Val Phe 
275 280 



<210> 13 

<211> 177 

<212> PRT 

<213> Gossypium hirsutum 

<400> 13 

Met Lvs Val Leu Ser Pro He Leu Ala Cys Leu Ala Leu Ala Val Val 
1 5 10 I 5 

Ala Ser His Ala Ala Leu Ser Pro Glu Gin Tyr Trp Ser Tyr Lys Leu 
20 25 ■. 30 

Pro Asn Thr Pro Met Pro Lys Ala Val Lys Glu He Leu His Pro Glu 
35 • 40 45 

Leu Met Glu Glu Lys Ser Thr Ser Val Asn Val Gly Gly Gly Gly Val 
50 55 60 



Asn 



Val Asn Thr Gly Lys Gly Lys Pro Ala Gly Gly Thr His Val Asn 
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-7H 75 80 

65 70 

Val Gly Arg Lys Gly Val Gly Val Asn Thr Gly Lys Pro Gly Gly Gly 

Thr His Val Asn 111 Gly Gly Lys Gly Val Gly Val Asn Thr Gly Lys 
100 105 110 

Pro Gly Gly Gly Thr His Val Asn Val Gly Gly Lys- Gly Gly Gly Val 
115 I 20 125 

Ser Val His Thr Gly His Lys Gly Lys Pro Val Asn Val Asn Val Ser 
130 135 140 

Pro Phe Leu Tyr Gin Tyr Ala Ala Ser Glu Thr Gin He His Asp Asp 
14 5 150 i55 

Pro Asn Val Ala Leu Phe Phe Leu Glu Lys Asp Leu His Pro Gly Gin 
165 170 



Gin 



<210>* 14 
<211> 282 
<212> PRT 

<213> Gossypium hirsutum 
<400> 14 

Leu Ser Glu Ser Lys Glu Met Val Phe Gin Phe Asn Phe Pro Val Leu 
1 5 

Leu Leu Cys Leu Met Phe Leu Met Cys Gly Arg Gly Asn Ala Val Arg 
20 25 30 

Asp Leu Glu Gly Lys His Asp Phe Glu Ser His Gly Arg Asp Asp Glu 
35 40 45 

Val Glu Ser Leu Asp Asp Lys Tyr Val Ser Ala Tyr Phe His Gin Thr 
50 55 60 

Phe Asp Ser Ala Asn His Phe Asp Gly Gly Asp Glu Val Lys Asn Leu 
65 ' 70 75 

Glu Asp Lys Tyr Ser Thr Ala Tyr Phe His Lys Ser Leu Asp Ser Gly 
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85 



90 



95 



Asn His Gly Arg Asp Asp Lys Ala Lys He Leu Glu Asp Lys Tyr Ala 
100 1° 5 110 

Thr Ala Tyr Phe His Lys Thr Ser Val Phe Glu Asn His Gly Glu Gly 

115 120 125 

Asp Lys Leu Lys Ser Leu Glu Asp Lys Tyr Ser Ala Ala Tyr Phe His 
130 " 135 140 

Asn Thr Gin Ser Ser Lys Met Met Lys Asp His Asn Met Glu His His 
145 150 . 155 160 

His His Tyr His Asn His Val Glu Ser Ala Glu He Gly Leu Phe Thr 
165 l" 70 175 

He Asp Glu Leu His Thr Phe Asn Val Gly Lys Lys Leu Pro He Phe 
180 185 190 

Phe Pro He Lys Asn His Ser Leu Tyr Pro Pro Leu Leu Pro Lys Gin 
19 5 .200 205 

V 

lie Ala Asp Thr He Pro Phe Ser Ser Phe Gin Val Ser Asn He Leu 
2io ~ 215 220 

Arg Phe Phe Ser Val Ser Pro Asp Ser Pro Lys Gly Lys Ser Cys Ser 
225 230- 235 240 

Ara Tyr Leu Arg Lys Met Arg Thr Arg Ser Ser Ala Arg Gly Arg Pro 
245 250 255 

Lys He Trp Ala Thr Ser Leu Lys Ser Leu His Gly Phe Leu Ser Met 
260 265 270 

His Leu Gly Pro Met Leu lie Ser Ser Ser 
275 280 



<210> 15 
<211> 55 
<212> PRT . 

<213> Gossypium hirsutum 
<400> 15 

Lys Trp Glu Ala Gly Gin Ser Gin Cys Met Val Val Leu Val Phe Thr 
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1 



5 



10 



15 



Gin He Ser Leu Val Lys Gly Lys Arg Lys Leu Cys Tyr Ser Ser He 
20 25 30 

Val Ala Leu He Leu Glu Ser Val Leu Phe Val Leu Thr Phe Pro Ala 
35 40 45 

Leu Thr Asp Met Asn Leu Tyr 

50 55 
<210> 16 
<211> 235 
<212> PRT 

<213> Gossypium hirsutum 
<400> 16 

Met Pro Arg Thr Arg Arg Phe Asn Pro Pro Ser He Thr Ser Arg Thr 
1 5 10 15 

Leu Gly His His Val Tyr Lys Asp Asp Asn Pro He Val Tyr Gly Thr 
20 25 30 

Met Gin Ala Tyr Leu Lys Asp Ala Arg Glu Arg Leu Phe Asn Thr Ala 
35 40 45 

Arcr Thr Ala Glu Lys Leu Gly He His Met Gly Phe Lys Leu Val Arg 
50 55 60 

Gly Ala Tyr Met Ser Ser Glu Thr Lys Leu Ala Ser Ser Leu Gly Phe 
65 70. 75 80 

Asp Ser Pro Val His Asn Thr He Gin Asp Thr His Ala Cys Phe Asn 
85 90 95 

Asp Cys Ala Ser Phe Met He Glu Lys He Ala Asp Gly Tyr Gly Gly 
100 105 _ 110 

Leu Val Leu Ala Thr His Asn Leu Glu Ser Gly Lys Leu Ala Ala Ser 
115 120 125 

Lys Ala Arg Asn Leu Gly He Glu Lys Gly Asn Gin Lys Leu Glu Phe 
130 ' 135 ■ 140 

Ala Gin Leu Tyr Gly Met Ser Glu Ala Leu Ser He Gly Leu Arg Asn 
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145 150 155 160 

Ala Gly Phe Gin Val Ser Lys Tyr Leu Pro Tyr Gly Pro Val Asp Met 
165 170 ±f ° 

Val Met Pro Tyr Leu Leu Arg Arg Ala Glu Glu Asn Arg Gly Leu Leu 
180 1 Q5 190 

Ser Thr Ser Ser Leu Asp Arg Thr Leu Met Gly Lys Glu Leu Lys Arg 
195 200 205 

Arg Leu Lys Ser Leu Gin Phe Ala Lys Pro Glu Met Ala Ala Ser Ala 

210 215 220 

Ala Gly Ser Met Lys lie Glu lie Gly Thr Pro 
225 230 235 

<210> 17 

<211> 2207 

<212> DNA 

<213> Gossypium hirsutum 



<400> 17 
gtcaagttct 


ggttccaaaa 


caagcgcacc 


caaatgaagg 


cccaacatga 


acgccatgaa 


60 


aatgctatac tgaaggctga 


gaatgaaaaa 


ctccgagctg 


agaataatag 


gtacaaggaa 


120 


gctctcagca 


atgctacatg 


ccccagctgt 


ggaggcccag 


ctgcccttgg 


agagatgtca 


180 


tttgatgagc 


aacatttgag 


aatagaaaat 


gctcggttaa 


gggaagagat 


tgataggata 


240 


tctggaatag 


ctgctaaata* tgttggcaag 


cctttatctt 


cattgcctca 


cctttcatct 


300 


catttacatt 


cgcgctctgc 


tgatcttgga 


gctagcaatt 


tcgggaatca 


atcaggattt 


360 


gtaggggaaa 


tggatcgcag 


tggtgatctt 


ctgaggtctg 


tctctggacc 


tacagaagcg 


420 


gataagccca 


tgattgttga 


gcttgctgtt 


gctgcaatgg 


aggaactaat 


acgaatggcc 


480 


caatctgggg 


aacctttgtg 


ggttcctggg 


gacaattcta 


cagatgtgtt 


gaacgaagat 


540 


gaatacttaa gaactttccc 


taggggaatt 


ggaccaaagc 


ctttggggtt 


gaggtctgaa 


600 


gcttcaagag 


aatctgcagt 


tgtcatcatg 


aatcatgtca 


acttagttga 


gattctcatg 


660 


gatgtgaatc 


aatggtcaag 


tgtgttttgc 


ggtatt,gttt 


caagggctat 


gactttagaa 


720 


gtcctatcaa 


ctggagttgc 


aggaaactac 


aatggggcct 


tgcaagtgat 


gacggctgag 


780 


ttccaagtcc 


cttcaccact 


tgtaccaact 


cgggaaaatt 


atttcgcgag 


gtactgtaag 


840 


cagcatattg 


atggaacttg 


ggcagtggtt 


gatgtttcct 


tggataattt 


acgccctaac 


900 
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ccaatgtcaa 

tctaaggtta 

agaccagtag 

cgacagtgtg 

gttataacaa 

agcttttgta 

ggctccgatg 

ggtattgtac 

gatttcctaa 

gttcaagaaa 

cgcgtaaata 
gatgctaaag 



gtgtagagag 

tatgggtcga 

ttaattccgg 

agcgtctagc 

gcccagaagg 

caggtgttgg 

atgtgcgggt 

ttagtgctgc 

gggatgagaa 

tggctcacat 



gccctcaggt 

gcatgtagaa 

tctagctttt 

aagttcaatg 

gaggaaaagt 

tgcttctacg 

tatgacccga 

aacttccttc 

ctctagaagt 

agctaatggt 



tgcttgatcc 

gtggatgata 

ggagcaaaac 

gccagtaaca 

atgttgaagt 

gcccatgctt 

aagagcatgg 

tggatccaag 

gagtgggata 

cgtgatccag 



agaattgcca 

gagctgtcca 

gttgggtggc 

ttccagcagg 

tggcagagag 

ggacaacttt 

atgatccagg 

ttccaccaaa 

tcctatcaaa 

gcaattgtgt 



aatggatacc 

caacatatac 

tacgttggat 

gggtctatgc 

gatggtgact 

atcggcaaca 

aaggcctcct 

gagggtattt 

tggtggccta 

ctctttactc 



gtgcaaactc 
ggtcctacgt 



tagccaaagc 
gatatatgcc 



aacatgttga 
ccggtcaata 



tacttcaaga 
ttgttgcaat 



ttaagtggcg 

gatggtccag 

accgttgctt 

gtggcgactg 

tgcaataatg 

ccgaaaacaa 

gtcatggtgt 

ttgactttgc 

ggctaagagt 

tgttat.tgtt 



gggacccgga 

gagttaatgg 

tccagatttt 

tcaacagtct 

cttgaccaaa 

atttaacgat 

tctgtaagaa 

caacgagatt 

tcaatgtagg 

gaagcattag 



ttatgtcgca 

aggagggatc 

ggttgattca 

aattaaatgc 

catgatataa 

tgaagaagtc 

ggcaaaatca 

ctaatattag 

agtggacttt 

ttaatttgga 



ctattgccat 

ctcgaaatcg 

gttcccacag 

acggttgaaa 

aaaaaggaaa 

aagagcgcac 

tcaagcctgc 

atatgttggg 

atactagtct 

cttattcctc 



ccggtttcgc 
gctcgggtgg 
caaagctttc 
ggatcaaggc 
cgagaagaaa 
ctttcaattc 
aaggatagta 
agaactcccc 
aatttctttc 
cattaac 



gagctgcact 
gaacatcgtc 

aattcttccc 

ctctctcctt 

tcttggatca 

tgccgtaaag 

aggtgtttgt 

atcctttgcg 

ggttcgggaa 

attttgtgta 

tggtttcatg 



960 
1020 
1080 
1140 
1200 
.1260 
1320 
1380 
1440 

1500 

1560 
1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2207 



<210> 18 
<211> 1872 
<212> DNA 

<213> Gossypium hirsutum 

gtcaagttct ggttccaaaa caagcgcacc caaatgaagg cccaacatga acgccatgaa 
aatgctatac tgaaggctga gaatgaaaaa ctccgagctg agaataatag gtacaaggaa 
gctctcagca atgctacatg ccccagctgt ggaggcccag ctgcccttgg agagatgtca 
tttgatgagc aacatttgag aatagaaaat gctcggttaa gggaagagat tgataggata 



60 
120 
180 
240 
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420 
480 
540 
600 
660 
720 



tctggaatag ctgctaaata tgttggcaag cctttatctt cattgcctca cctttcatct 300 
catttacatt cgcgctctgc tgatcttgga gctagcaatt tcgggaatca atcaggattt 360 
gtaggggaaa tggatcgcag tggtgatctt . ctgaggtctg tctctggacc tacagaagcg 
gataagccca tgattgttga gcttgctgtt gctgcaatgg aggaactaat acgaatggcc 
•caatctgggg aacctttgtg ggttcctggg gacaattcta cagatgtgtt gaacgaagat 
gaatacttaa gaactttccc taggggaatt ' ggaccaaagc ctttggggtt gaggtctgaa 
gcttcaagag aatctgcagt tgtcatcatg aatcatgtca acttagttga gattctcatg 
gatgtgaatc aatggtcaag tgtgttttgc ggtattgttt caagggctat gactttagaa 
gtcctatcaa ctggagttgc aggaaactac aatggggcct tgcaagtgat gacggctgag 780 
-ttccaagtcc cttcaccact tgtaccaact cgggaaaatt atttcgcgag gtactgtaag 840 
cagcatattg atggaacttg ggcagtggtt gatgtttcct tggataattt acgccctaac 
ccaatgtcaa gtgtagagag gccctcaggt tgcttgatcc agaattgcca aatggatacc 
tctaaggtta tatgggtcga gcatgtagaa gtggatgata gagctgtcca caacatatac 
agaccagtag ttaattccgg tctagctttt ggagcaaaac gttgggtggc tacgttggat 
cgacagtgtg agcgtctagc aagttcaatg gccagtaaca ttccagcagg gggtctatgc 
gttataacaa gcccagaagg gaggaaaagt atgttgaagt tggcagagag gatggtgact 
agcttttgta caggtgttgg tgcttctacg gcccatgctt ggacaacttt atcggcaaca 
ggptccgatg atgtgcgggt tatgacccga aagagcatgg atgatccagg aaggcctcct 
ggtattgtac ttagtgctgc aacttccttc tggatccaag ttccaccaaa gagggtattt 
gatttcctaa gggatgagaa ctctagaagt gagtgggata tcctatcaaa tggtggccta 14 40 
gt'tcaagaaa tggctcacat agctaatggt cgtgatccag gcaattgtgt ctctttactc 
cgcgtaaata gtgcaaactc tagccaaagc aacatgttga tacttcaaga gagctgcact 
gatgctaaag ggtcctacgt gatatatgcc ccggtcaata ttgttgcaat gaacatcgtc 
ttaagtggcg gggacccgga ttatgtcgca ctattgccat ccggtttcgc aattcttccc 
gatggtccag gagttaatgg aggagggatc ctcgaaatcg gctcgggtgg ctctctcctt 174 0 
accgttgctt tccagatttt ggttgattca gttcccacag caaagctttc tcttggatca 1800 
gtggcgactg tcaacagtct aattaaatgc acggttgaaa ggatcaaggc tgccgtaaag 18 60 
tgcaataatg ct 



900 • 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



1500 
1560 
1620 
1680 



1872 
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<210> 19 

<211> 1180 

<212> DNA 

<213> Gossypium hirsutum 



<400> 19 
tagaaattgt 


tatacagttc 


tagctaaggt 


tcatttgaaa 


gatacataca 


4- a<i -i f— -^i r~~> 3 /— « ;ri 


60 


tatatat'atg 


gggagatcac 


catgttgtga 


aaaggtaggg 


t tgaagaaag 


z^r-}- f~* r~> zi t~ rrn3<^ 
y Luua uyyau 


120 


cccagaagaa 


gatcaaaagc 


tcttagctta 


cattgaacaa 


catggccatg 


t*-w ^ -a 4— f~r i~w r~* f~r 

gadgcuyguy 


180 


tgccttgcct 


tcaaaagctg 


ggcttcaaag 


atgtggaaag 


agttgcagac 


t gaga t ggat 


9 4 n 

£. *i \J 


taactacttg 


agacctgata 


tcaaaagagg 


aaagttcagt 


ttacaagaag 


aacagaccaT: 




tattcaactc 


catgcccttc 


ttggaaacag 


gtggtctgcc 


atagctactc 


atttgccgaa 


o u u 


aagaacagac 


aatgagatca 


agaactactg 


gaacacacat 


ctaatgaaaa 


gyCuaaccaa 


4 20 


aatggggatc 


gatcctgtca 


cccacaagcc 


taaa&ccgat 


gcactcggct 




4 80 


taaccctaaa 


gatgctgcta 


accttagtca 


catggctcaa 


tgggagagtg 


ctcy tl. Lay a 


540 


agctgaagct 


agactggttc 


gtgagtccaa 


gctagttcct 


tcaaaccct c 


ctcaaagcaa 


600 


ccatttcact 


gccgttgcgc 


cttcgccgac 


tccggcaact 


agaccgcaat 


+■ r—* fx "4— 

gCCLCyaCy L_ 


660 


actcaaagca 


tggcaaggtg 


tcgtctgcgg 


gtt attcact 




CL GL CA v— Q *■ 


720 


cttacagtcc 


cctacgtcaa 


cgttgaactt 


catggagaac 


accacaacat 


tgcccatgtc 


—] o <~\ 

/ o U 


atcatcatcg 


tctgttaatg 


gaatgtttaa 


tgaaaacttt 


ggttggaact 


catcgattaa 


840 


tccatgtgaa 


agtggggata 


atttgaaagt 


tgaatatggc 


agtgatcaaa 


ttccagagtt 


900 


aaaggaaaga 


ttggatcatc 


caatggaatt 


gcatgaaatg 


gactattctt 


cagagggtac 


960 


atggtttcaa 


gagttgtttg 


gatttaatgg 


tttatgattc 


tgcagaagga 


ttcatcaaag 


1020 


gaaagaaagc 


tatctggttt 


catctttgaa 


gttcacttaa 


gtgtaggatt 


tttattcaca 


1080 


agtgccttca 


catattacca 


ttaactgtaa 


taataaacct 


tcaaattaat 


aaattaaaaa 


1140 


actcacaagg 


gtttttggcc 


aaaaaaaaaa 


aaaaaaaaaa 






1180 



<210> 20 
<211> 927 
<212> DNA 

<213> Gossypium hirsutum 
<400> 20 

atggggagat caccatgttg tgaaaaggta gggttgaaga aaggtccatg gaccccagaa 
gaagatcaaa agctcttagc ttacattgaa caacatggcc atggaagctg gcgtgccttg 



dnnv wnvifferi bv USPTO from the IFW Image Database on 04/19/2005 



ccttcaaaag ctgggcttca aagatgtgga aagagttgca gactgagatg gattaactac 180 

ttgagacctg atatcaaaag aggaaagttc agtttacaag aagaacagac cattattcaa 240 

ctccatgccc ttcttggaaa caggtggtct gccatagcta ctcatttgcc gaaaagaaca 

gacaatgaga tcaagaacta ctggaacaca catctaatga aaaggctaac caaaatgggg 

atcgatcctg tcacccacaa gcctaaaacc gatgcactcg gctccaccac tggtaaccct 

aaagatgctg ctaaccttag tcacatggct caatgggaga gtgctcgttt agaagctgaa 

gctagactgg ttcgtgagtc caagctagtt ccttcaaacc ctcctcaaag caaccatttc 

actgccgttg cgccttcgcc gactccggca actagaccgc aatgcctcga. cgtactcaaa 

gcatggcaag gtgtcgtctg cgggttattc actttcaaca tggacaataa caacttacag 

tcccctacgt caacgttgaa cttcatggag aacaccacaa cattgcccat gtcatcatca 

tcgtctgtta atggaatgtt taatgaaaac tttggttgga actcatcgat taatccatgt 

gaaagtgggg. ataatttgaa agttgaatat ggcagtgatc aaattccaga gttaaaggaa 

agattggatc atccaatgga attgcatgaa atggactatt cttcagaggg tacatggttt 

caagagttgt ttggatttaa tggttta 

<210> 21 
<211> 600 
<212> DNA 

<213> Gossypium hirsutum 

agcgatgtga gcgtctcctc ctctgtgtaa tctctgatgc aagatccatc cattatcttc 
cctctgtatt ggctactgca accatgatgc acgtcataga ccaagttgag cttttcaatc 
ccattgacta ccaaaatcag ctgctgagtg ttcttaaaat tagcaaggaa aaagtaaacg 
attgttacaa gctcatcctt gatgtatcaa caagacccca ggcccaaggc aatggtggtg 
catgtaagag gaaggtggag gagagggttc ctagcagccc tagtggagtg attgatgctg 
catttggcag tgatagctcg agcgattctt ggggcacggt gtccttatcg cctgagcagc 
agccaccttt taagaagagc agagcccaag agcaagtaat gcgtttgcca tcactcaacc 
gagtctttgt agacattgtt ggcagccctt cttaattata tctcccttct ctctctccct 
cgctctctcc atctctttct ttgtcccaaa aagatctata tttattatgc ttatgttcac 
ttttggttca aggaatcaaa tgttaagtta aaaaaatgaa aaaaacaaag taaaagctgc 



300 
360 
420 
480 
.540 
600 
660 
720 
780 
840 
900 
927 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



<210> 22 
<211> 452 



Cody orovided by USPTO from the IFW Image Database on 04/19/2005 



180 
240 
300 
360 
420 
452 



<212> DNA 

<213> Gossypium hirsutum 

agcgatgtga gcgtctcctc ctctgtgtaa tctctgatgc aagatccatc cattatcttc 60 
cctctgtatt ggctactgca accatgatgc acgtcataga ccaagttgag cttttcaatc 120 
ccattgacta ccaaaatcag ctgctgagtg ttcttaaaat tagcaaggaa aaagtaaacg 
attgttacaa gctcatcctt gatgtatcaa caagacccca ggcccaaggc aatggtggtg 
catgtaagag gaaggtggag gagagggttc ctagcagccc tagtggagtg attgatgctg 
catttggcag tgatagctcg agcgattctt ggggcacggt gtccttatcg cctgagcagc 
agccaccttt taagaagagc agagcccaag agcaagtaat gcgtttgcca tcactcaacc 
gagtctttgt agacattgtt ggcagccctt ct 

<210> 23 
<211> 704 
<212> DNA 

<213> Gossypium hirsutum 

gctgiacacc ccaaagatgg ccaaccacac cgttaccttt ctccctaaac tatccattga 60 
agctattcag acagtgactc cgatgaggat aactgaacca cgacagactc gacaagtatt ■ 120 
ggcaggggag cttgtaggac ccgggatttt ccaaaggtgt ttgaacgtgg tccagtacta 
catgaaggag aaagaagaag actctggttg gttactggct gggtggatca aggaaacact 
tgggagagct ttacatgagc aaccaatgat ttctggtcgt cttcggaaag gggaacgaaa 
cgatggagaa ttggagattg tttccaatga ctgcggcatt agactcattg aggcaaggat 
tcagatgaat ctgtcggatt ttcttgattt gaaacaaagg gaagatgctg aagctcagct 
tgttttctgg aaagatattg atgagcaaaa cccacagttc tccccactct tttatgttca 
gg ttactaat ttccagtgtg gtggatattc aattgggatt agctgcagta ttcttctggc 
agatcttttg ttaatgaaag aattccttaa gacatgggca gatattccaa caaggttatt 
atcaacaaaa acgatgaaca aaagcttcct ttattctacc ttcctggctg aaaaacacca 
atggtgcctc cctacatcat cacatcaaat tcaagcaaaa ctca 



180 
240 
300 
360 
420 
480 
540 
600 
660 
704 



<210> 24 

<211> 548 

<212> DNA 

<213> Gossypium hirsutum 



Codv n^nviHed bv USPTO from the IFW Image Database on 04/19/2005 



<220> 

<221> misc^f eature 

<222> (491) . . (491) 

<223> n = unknown 



240 
300 
360 
420 
480 
540 
548 



ctgagttaag agtttcaatt cttctactta ttatagttaa atatcatata tggcoaagta 
cttgaatgtt gtgcttgttc ttgctctagt agtggttoaa gctactgcaa ggaatgtgcc 
tagcgatgct gctggtctca atgaccaaaa gaacctcctc acataoggtg gcattggcgg 
ctactctggc atgggttcaa atggcatgcc aatgggtgga gttgggagtg ttggtggtat 
gactggcctt ggtggtacag gtgggatggg cgccatggta ggtgttgggt atggaggtgg 
gcctggcgct ggtggtggaa atgaaggtgg tgttggcatt ggcaatgcgc ctggtgtcgt 
ccactttcct tgaactttgc tggatggtta aaattttaaa gcaactagtt tcttgaactt 
tgctggaggg gtttaaattt taaagcaaot agtctaactt acgttaaaga ataatattaa 
tgttgctcta nagtgtgaaa tgttgtoctg tgtatgggtt atgtgataag tccatcttta 
tttttttt • 

<210> 25 
<211> 321 
<212> DNA 

<213> Gossypium hirsutum 

atggccaagt acttgaatgt tgtgcttgtt cttgctctag tagtggttca agctactgca ' 60 

aggaatgtgc ctagcgatgc tgotggtctc aatgaccaaa agaacotcct cacatacggt 120 

ggcattggcg gctactctgg catgggttca aatggcatgc caatgggtgg agttgggagt 

attggtggta tgactggcct tggtggtaca ggtgggatgg gcgccatggt ^ggtgttggg 
Xtggajgtg ggcctggcgc tggtggtgga aatgaaggtg gtgttggcat tggcaatgcg 

cctggtgtcg tccactttcc t 

<210> 26 

<211> 727 . 

<212> DNA 

<213> Gossypium hirsutum 
<220> 

<221> mi sc___f eature 

<222> (26) . . (26) 

<223> n = unknown 



180 

240 
300 

321 



r^, hv IISPTO from the IFW Image Database on 04/19/2005 



ccaaaatgta agtcttcaaa accaanagaa gaaactgtaa agcagtagta atgcaaatgc 
ttagacactc aaatataagt agcaaactaa octatgggtt atttggctga ttttgaaggg 
ttcatggtgt attttggtgc gtgtctgtta agaatccgag ttgttgtccc gtggtattag 
cttctctgtc ttgctggttg cgattgggca gttgtgacgt ctataatcaa gtgattcaag 
gaaaccgtta gcttcatttt acttggagaa gacaaagaag ctattgttgt gctggacttg 
ttcttgcttt ttctctttgt atggtgtggt ttatggtttg tattatgagt tttatatgaa 
tagaactttg aatttggtga gaaaattaag aatgagcttg ggaggagcag aagtgttgat 
ggcaatagca gggttgtggg cagtggtttt gaggccattg atgataaggt atgccgtaga 
gatgagtcaa atgattggaa tttccgttag gagatttttc agtaatcctc tttccocttc 
cgtatcgttt ttttattggt actgatatag aaattctatg aaatgagoac aatatgagac 
accatttttt gctagccaag aagttagatg agtagtagac tttggtttaa gcttatcata 
attgaaattg ttagactgta acccttttgt ctcctttotc taatttcaaa tccaaattcc 
catcaat 



<210> 27 

<211> 562 

<212> DNA 

<213> Gossypium hirsutum 
<220> 

<221> miscjeature 

<222> (26) . . (26) 

<223> n = unknown 



300 

360 

420 

480 

540 

600 

660 

720 

727 



ccaaaat^a agtcttcaaa accaanagaa gaaactgtaa agcagtagta atgcaaatgc 60 
ttagacactc aaatataagt agcaaactaa cctatgggtt atttggctga ttttgaaggg 120 

ttcatggtgt attttggtgc gtgtctgtta agaatccgag ttgttgtccc gtggtattag 
cttctctgtc ttgctggttg cgattgggca gttgtgacgt ctataatcaa gtgattcaag 
gaaaccgtta gcttcatttt' acttggagaa gacaaagaag ctattgttgt gctggacttg 
ttcttgcttt ttctctttgt atggtgtggt ttatggtttg tattatgagt tttatatgaa 
tagaactttg aatttggtga gaaaattaag aatgagcttg ggaggagcag aagtgttgat 
ggcaatagca gggttgtggg cagtggtttt gaggccattg atgataaggt atgccgtaga 



180 
240 
300 
360 
420 
480 



How iwws,w kv USPTO from the 1FW image Database on 04/19/2005 



540 
562 



gatgagtcaa atgattggaa tttccgttag gagatttttc agtaatcctc tttccccttc 
cgtatcgttt ttttattggt ac 

<210> 28 
<211> 835 
<212> DNA 

<213> Gossypiura hirsutum 

Jacggtgg^g ctgatgctac cggcacaatg gggggagctt gtggttatgg aaacctgtac 60 
agtcaagggt atggaacgag cacagcagct ttgagcactg cacttttcaa caatggcttg 120 
agctgcggtg cctgctacga gctccggtgc aacaatgatc ctcaatggtg cattagtcga 
accataaccg tgacagccac caacttttgt ccacctaact atgctttatc tagtgacaat 
ggcgggtggt gcaatccccc acgagaacac tttgatttgg ccgaaccggc attcttgcgg 
atagcagaat atcgagctgg aatcgtccct gttatgttca gaagggtgtc atgtgtgaag 
aaaggaggca tcaggtacao catgaatgga cattcgtact tcaacatggt gttgataacg 
aacgtgggag gggcagggga tataacgtca gtgtccatca agggttccag aacaggatgg 
ctacctatgt ccagaaattg gggccaaaac tggcagagca atgcttacct taacggacaa 
agcctctctt ttaaagtgac tgccagcgat ggcaggacta tcacagccta caatgtagtg 
cctgctggtt ggcaattcgg acaaactttt gaaggaggcc agttttaaga caatattata 
gtgtctgtct aatataaaac tggaattgac atattactta tataaggcac atgagcgttt 
tatgccgagg tagcaaaatg gcgcccgctg gctttatgtg tgaaataggc gagcaagtgc 
cattagccta taatctatac atttcttata gtgaaccaaa ctattaagtt tgaac 

<210> 29 
<211> 765 
<212> DNA 

<213> Gossypium hirsutum 

Jacggtggtg ctgatgctac cggcacaatg gggggagctt gtggttatgg aaacctgtac 60' 

aqtcaagggt atggaacgag cacagcagct ttgagcactg cacttttcaa caatggcttg 120 
agctgcggtg cclgctacga gctccggtgc aacaatgatc ctcaatggtg cattagtcga 180 

accataaccg tgacagccac caacttttgt ccacctaact atgctttatc tagtgacaat 24 0 

ggcgggtggt gcaatccccc acgagaacac tttgatttgg ccgaaccggc attcttgcgg 300 

atagcagaat atcgagctgg aatcgtccct gttatgttca gaagggtgtc atgtgtgaag 360 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

835 



Cody orovided bv USPTO from the IFW Image Database on 04/19/2005 



aaaggaggca tca g g tacac catgaatgga cattcgtact tcaacatggt gttgataacg 
aacgtgggag gggcagggga tataacgtca gtgtccatca agggttccag aaoaggatgg 
ctacctatgt ccagaaattg gggccaaaac tggcagagca atgcttacct taacggacaa 
agcctctctt ttaaagtgac tgccagcgat ggcaggacta tcacagccta caatgtagtg 
cctgctggtt ggcaattcgg ac'aaactttt gaaggaggcc agttttaaga caatattata 
gtgtctgtct aatataaaac tggaattgac atattactta tataaggcac atgagcgttfc 
tatgccgagg tagcaaaatg gcgcccgctg gctttatgtg tgaaa 

<210> 30 
<211> 985 
<212> DNA 

<213> Gossypiura hirsutum 

tg^ccctttc tacagcagca attacctatt acatgagagc tgcatgatga tgatagcatc 
actagttccc aacttcatga tgggagtcat aattggagct ggttatatag gtttgctaat 
gatgacagct gggtatttca gattgctgcc agatctccct aagatattct ggcgttaccc 
tgtttcatat atcaactatg gtgcatgggc attgcaggga gcatacaaga atgatatggt 
tgggcttgag tttgatggct tcatacctgg tggtccaaaa ctgaaaggtg atgtcgtcct 
cacatccatg ctaggcatcc atctggatca ttcaaagtgg tgggacttag cagctgttat 
aatgattttg atagcttata gattactttt cttcatcatt ctcaagttca aggagagagt 
gtcaccattg tttcgaactc tttatacatg gcgaacattg cagcacatga aaaaacgacc 
ttcttttagg aaaacatcag ccttcccatc caagaggcac caagttctac attcactgtc 
ttctcaagag ggtctaaact ctccaattca ctagaagcaa caaatcatga gtactatagt 
aatgctctta ctggaatttg attacagaaa caaagggaaa gagattatag tagaattaca 
tatggaatta cctgtatcag ctttattttt caagtgcttc taatatctgc ggactgttct 
ggcattaatg gcaagagagt ttcccatcac ccaagaatgg tttgtttatg gtcctcccta 
gcaatggcga tgaagagcag aaacctgatt tctgttgttg caaccagtgc tttgaagtaa 
ccagatatga taaacaggta cagaaaatat cccattgttc ttcgtagata atttcatctg 
ccaaatgttt gtagctgatg cctcctacat tatacaatgt cataacatct aatgatacca 
ttatatttgt acgtaaaaaa aaaaa 

<210> 31 



420 
480 
540 
600 
660 
720 
765 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
985 



Cn*w orovided bv USPTO from the IFW Image Database on 04/19/2UU& 



<211> 571 
<212> DNA 

<213> Gossypium hirsutum 
<400> -31* 

tgtccctttc tacagcagca attacctatt acatgagagc tgcatgatga tgatagcatc 60 

actagttccc aacttcatga tgggagtcat aattggagct ggttatatag gtttgctaat 120 
gatgacagct gggtatttca gattgctgcc agatctccct aagatattct ggcgttaccc . 180 

tgtttcatat atcaactatg gtgcatgggc attgcaggga gcatacaaga atgatatggt 240 

tgggcttgag tttgatggc.t tcatacctgg tggtccaaaa ctgaaaggtg atgtcgtcct 300 

cacatccatg ctaggcatcc atctggatca ttcaaagtgg tgggacttag cagctgttat 360 

aatgattttg atagcttata gattactttt cttcatcatt ctcaagttca aggagagagt 420 

gtcaccattg tttcgaactc tttatacatg gcgaacattg cagcacatga aaaaacgacc 480 

ttcttttagg aaaacatcag ccttcccatc caagaggcac caagttctac attcactgtc 540 

ttctcaagag ggtctaaact ctccaattca c 571 

<210> 32" 
<211> 2611 
<212> DNA 

<213> Gossypium hirsutum 
<400> 32 

caaagaaatg gctaatcctg tgatcactcg cgtccacagt ctccgtgagc gtttagatga 60 
gacccttctt gcccacagga acgagatttt ggccttgctc 'tcaaggatcg agggcaaagg 120 
aaaaggaatt ctgcaacacc atcaaattat tctagagttt gaagctatcc ctgaagagaa 180 
cagaaagaag ctcgctgatg gtgcattttt tgaagtattg aaggctagtc aggaagcgat 24 0 
cgtgttgcct ccatgggttg cacttgctgt tcgtccaagg cctggtgttt gggagtacat 300 
tagagtgaat gttcacgccc ttgttgttga ggaacttact gttgctgagt atctccactt 360 
caaggaagag cttgttgatg gaagttcaaa tggaaacttt gttttggaat tggattttga 
gcccttcaac tcatcattcc cccgcccaac tctttcaaaa tccgttggta atggtgtgga 
gttcctaaat cgtcaccttt cggcaaaatt gttccatgac aaggagagca tgcacccttt 54 0 



420 
480 



600 
660 



gctcgaattc ctcagagtcc attgccacaa gggcaagaac atgatgttga atgacagaat 

tcagaacttg aatgctcttc aacatgtttt gaggaaagca gaggagtatc ttggtaccct 

acctcctgag acaccatgtg ccggattcga acaccggttc caggaaatcg gtttggaaag 720 

aggttggggt gacaccgcac aacgcgtgct cgagatgatc caactccttt tggatcttct 780 



Copy provided by USPTO from the IFW Image Database on 04/19/2005 



tgaggcacct 


gatccttgca 


cccttgagaa 


gttccttggg 


agaatcccca 


tggtgttcaa 


840 


tgttgtgatt 


ctcactcccc 


acggatactt 


cgctcaagac 


aatgttttgg 


ggtatcccga 


900 


caccggtggc 


caggttgttt 


acatcttgga 


tcaagtccga 


gctttggaga 


atgagatget 


960 


cctccgtata 


aagcaacaag 


gactcaacat 


cacccctcga 


atcctcatta 


ttactagact 


1020 


tcttcctgat 


gctgtcggaa 


caacatgcgg 


tcaacgactt 


gagaaagtat 


aeggaacaga 


1080 


gtactcggat 


attcttcgag 


tacccttcag 


aacagaaaag 


ggaattgttc 


gtaaatggat 


1140 


ctcaagattt 


gaagtctggc 


catacttgga 


aacctacaca 


gaggatgttg 


ctcatgaaat 


1200 


ctccaaagag 


ttgcaaggca 


agccagatct 


gatcategga 


aactacagtg 


atggcaatat 


1260 


cgtcgcctcc 


ttgctcgcac 


ataaattggg 


tgtcacacag 


tgcaccatcg 


cccatgcttt 


1320 


ggagaagaca 


aaatatcctg 


attcagatat 


ctactggaag 


aagcttgaag 


acaaatacca 


1380 


tttctcttgc 


caatttacag 


ctgatctttt 


tgcaatgaac 


catacagatt 


tcatcatcac 


1440 


cagtactttc 


caggaaattg 


caggaagcaa 


ggacactgtt 


ggtcaatacg 


agagccacac 


1500 


tgctttcact 


cttcctggtc 


tctaccgtgt 


tgtacatggt 


atcgatgtgt 


ttgatcccaa 


1560 


attcaacatt 


gtttcccctg 


gtgctgatat 


ggagatatac 


ttcccttaca 


ccgaagagaa 


1620 


gcggaggttg 


aagcatttcc 


atactgagat 


cgaagacctt 


ctttacagca 


aagttgagaa 


1680 


tgaagaacac 


ttatgtgtgc 


tcaatgaccg 


caacaagcca 


attctgttca 


caatggcaag 


1740 


gcttgatcgt 


gtcaagaact 


taaccggact 


cgtcgagtgg- 


tacggcaaga 


aegcaaagtt 


1800 


gcgtgagttg 


gctaacctcg 


tagttgtagg 


tggtgatagg 


cgaaaggaat 


ct-aaagattt 


I8 60 


ggaagagaag 


gccgaaatga 


agaaaatgtt 


tgagctgatc 


gagaagtaca 


acttgaaegg 


1920 


ccaattcaga 


tggatatcat 


ctcaaatgaa 


cagaatccga 


aatggtgaac 


tttaccgata 


1980 


catttgcgac 


acgaaaggtg 


cctttgtaca 


gectgeattg 


tatgaagect 


ttggattgac 


2040 


agttgtggag 


gcaatgactt 


gcggtttgcc 


aacattcgea 


acctgcaacg 


gtggaccagc 


2100 


cgagattatt 


gtccatggga 


aatctggttt 


caacattgat 


ccttaccatg 


gtgatcaagc 


2160 


tgctgacata 


ctggtcgatt 


tctttgaaaa 


gtgtaagaaa 


gatccatctc 


actgggataa 


2220 






cl a y a ^ s_» y ci 


a a a era a crt* a t 


acatggaaga 


tttactcgga 


2280 


gagactattg 


accctgaccg 


gagtgtatgg 


attct'ggaag 


catgtttcca 


accttgaacg 


2340 


ccgtgagagt 


cgtcgttacc 


ttgagatgtt 


ttatgetett 


aagtaccgea 


agctggctga 


2400 


atcagttcca 


ttggcagagg 


agtaaattta 


agctgttaaa 


taacattggg 


ccggtttttc 


2460 


ttggagaata 


atattctgtt 


ttgtaatttc 


aattggagaa 


gctcttttgt 


atttcatctt 


2520 



Copy provided by USPTO from the 1FW Image Database on 04/19/2005 



gtcttttcct 


tttccttttt 


tcgccggcat 


tgtttgaaca 


tggggttgtg 


cgcccgtcaa 


2580 


ttccagttaa atatggtgac ttttgttttt 
<210> 33 
<211> 2415 
<212> DNA 

<213> Gossypiura hirsutum 


c 

• 






2611 


<400> 33 
atggctaatc 


ctgtgatcac 


• 

tcgcgtccac 


agtctccgtg 


agcgtttaga 


tgagaccctt 


60 


cttgcccaca 


ggaacgagat 


tttggccfctg 


ctctcaagga 


tcgagggcaa 


aggaaaagga 


120 


attctgcaac 


accatcaaat 


tattctagag 


tttgaagcta 


tccctgaaga 


gaacagaaag 


180 


aagctcgctg 


atggtgcatt 


ttttgaagta 


ttgaaggcta 


gtcaggaagc 


gatcgtgttg 


240 


cctccatggg 


ttgcacttgc 


tgttcgtcca 


aggcctggtg 


tttgggagta 


cattagagtg 


300 


aatgttcacg 


cccttgttgt 


tgaggaactt 


actgttgctg 


agtatctcca 


cttcaaggaa 


360 


gagcttgttg 


atggaagttc 


aaatggaaac 


tttgttttgg 


aattggattt 


tgagcccttc 


420 


aactcatcat 


tcccccgccc 


aactctttca 


aaatccgttg 


gtaatggtgt 


ggagttccta 


480 


aatcgtcacc 


tttcggcaaa 


attgttccat 


gacaaggaga 


gcatgcaccc 


tttgctcgaa 


540 


ttcctcagag 


tccattgcca 


caagggcaag 


aacatgatgt 


tgaatgacag 


aattcagaac 


600 


ttgaatgctc 


ttcaacatgt 


tttgaggaaa 


gcagaggagt 


atcttggtac 


cctacctcct 


660 


gagacaccat 


gtgccggatt 


cgaacaccgg 


ttccaggaaa 


tcggtttgga 


aagaggttgg 


720 


ggtgacaccg 


cacaacgcgt 


gctcgagatg 


atccaactcc 


ttttggatct 


tcttgaggca 


780 


cctgatcctt 


gcacccttga 


gaagttcctt 


gggagaatcc 


ccatggtgtt 


caatgttgtg- 


840 


attctcactc 


cccacggata 


cttcgctcaa 


gacaatgttt 


tggggtatcc 


cgacaccggt 


900 


ggccaggttg 


tttacatctt 


ggatcaagtc 


cgagctttgg 


agaatgagat 


gctcctccgt 


960 


ataaagcaac 


aaggactcaa 


catcacccct 


cgaatcctca 


ttattactag 


acttcttcct 


1020 


gatgctgtcg 


gaacaacatg 


cggtcaacga 


cttgagaaag 


tatacggaac 


agagtactcg 


1080 


gatattcttc 


gagtaccctt 


cagaacagaa 


aagggaattg 


ttcgtaaatg 


gatctcaaga 


• 1140 


tttgaagtct 


ggccatactt 


ggaaacctac 


acagaggatg 


ttgctcatga 


aatctccaaa 


1200 


gagttgcaag 


gcaagccaga 


tctgatcatc 


ggaaactaca 


gtgatggcaa 


tatcgtcgcc 


1260 


tccttgctcg 


cacataaatt 


gggtgtcaca 


cagtgcacca 


tcgcccatgc 


tttggagaag 


1320 


acaaaatatc 


ctgattcaga 


tatctactgg 


aagaagcttg 


• 

aagacaaata 


ccatttctct 


1380 


tgccaattta 


cagctgatct 


ttttgcaatg 


aaccatacag 


atttcatcat 


caccagtact 


1440 



Copy provided by USPTO from the IFW Image Database on 04/19/2005 



ttccaggaaa 


ttgcaggaag 


caaggacact 


gttggtcaat 


acgagagcca 


cactgctttc 


1500 


actctfccctg 


gtctctaccg 


tgttgtacat 


ggtatcgatg 


tgtttgatcc 


caaattcaac 


1560 


attgtttccc 


ctggtgctga 


tatggagata 


tacttccctt 


acaccgaaga 


gaagcggagg 


1620 


t t gaagcatt 
cacttatgtg 


tccatactga 
tgctcaatga 


gatcgaagac 
ccgcaacaag 


cttctttaca 
ccaattctgt 


gcaaagttga 
tcacaatggc 


gaatgaagaa 
aaggcttgat 


1680 
1740 


cgtgtcaaga 


acttaaccgg 


actcgtcgag 


tggtacggca 


agaacgcaaa 


gttgcgtgag 


1800 


ttggctaacc 


tcgtagttgt 


aggtggtgat 


aggcgaaagg 


aatctaaaga 


tttggaagag 


1860 


aaggccgaaa 


tgaagaaaat 


gtttgagctg 


atcgagaagt 


acaacttgaa 


cggccaattc 


1920 


agatggatat 


catctcaaat 


gaacagaatc 


cgaaatggtg 


aactttaccg 


atacatttgc 


1980 


gacacgaaag 


gtgcctttgt 


acagcctgca 


ttgtatgaag 


cctttggatt 


gacagttgtg 


2040 


gaggcaatga 


cttgcggttt 


gccaacattc 


gcaacctgca 


acggtggacc 


agcc'gagatt 


2100 


attgtccatg 


ggaaatctgg 


tttcaacatt 


gatccttacc 


atggtgatca 


agctgctgac 


2160 


atactggtcg 


atttctttga 


aaagtgtaag 


aaagatccat 


ctcactggga 


taagatctcc 


2220 


caaggaggct 


tgaaacgtat 


cgaggagaag 


tatacatgga 


agatttactc 


ggagagacta 


2280 


ttgaccctga 


ccggagtgta 


tggattctgg 


aagcatgttt 


ccaaccttga 


acgccgtgag 


2340 


agtcgtcgtt 


accttgagat 


gttttatgct 


cttaagtacc 


gcaagctggc 


tgaatcagtt 


2400 


ccattggcag 


aggag 










2415 



<210> 34 
<211> 732 
<212> DNA 

<213> Gossypium hirsutum 
<400> 34 

agaaagatga tggaaagggg ttttattgtt ttggccttga cggtggtttt cgccgcgacg 60 
gtggttacgg cggctgacga gagtgggtta gcgaatgagt gcagcaaaga tttccagagc 120 
gtgatgactt gcttaagctt tgctcaagga aaagcagcgt cgccgtcgaa ggagtgttgt ' 180 
aattcagtgg cggggattaa agagaataaa cccaaatgtt tgtgttatat tttgcaacaa 24 0 
acacaaactt ccggtgctca aaatctcaaa agcttaggtg ttcaagaaga taagctgttt 300 
cagttaccgt cggcttgtca attgaagaac gctagcgtca gtgattgccc aaagcttctt 360 
gggttatctc cgagctcacc agacgccgcc atcttcacca actcctcctc taaagcaacg 420 
acacccagta cttcaacaac cacpgcaacg ccgtcttccg cggccgataa aaccgatagc 480 



Copy provided by USPTO from the IFW Image Database on 04/19/2005 



aaatccagtg 


gaatcaagct 


tggtccccac 


ttcgtcggtt 


ccacggcggc 


gctactggtt 


540 


gctacagcgg 


ccgtgttttt 


ccttgtattc 


ccagctggat 


ttgcttcaat 


agtttagggg 


600 


ttttgcatgg 


gatttcgaga 


tttggaggtt 


tatttattgt 


tgaagtccat 


ttgtttttaa 


660 


acggtctcag 


aaaaaaaatg 


gactgagttg 


acaattatga 


tgatttttcg 


tttatttttc 


720 


ctttttctta 


tt 










732 


<210> 35 
<211> 585 
<212> DNA 

<213> Gossypium hirsutum 










<400> 35 
atggaaaggg 


gttttattgt 


tttggccttg 


acggtggttt 


tcgccgcgac 


ggtggttacg 


60 


gcggctgacg 


agagtgggtt 


agcgaatgag 


tgcagcaaag 


atttccagag 


cgtgatgact 


120 


tgcttaagct 


ttgctcaagg 


aaaagcagcg 


tcgccgtcga 


aggagtgttg 


taattcagtg 


180 


gcggggatta 


aagagaataa 


acccaaatgt 


ttgtgttata 


ttttgcaaca 


aacacaaact 


240 


tccggtgctc 


aaaatctcaa 


aagcttaggt 


gttcaagaag 


ataagctgtt 


tcagttaccg 


300 


tcggcttgtc 


aattgaagaa 


cgctagcgtc 


agtgattgcc 


caaagcttct 


tgggttatct 


360 


ccgagctcac 


cagacgccgc 


catcttcacc 


aactcctcct 


ctaaagcaac 


gacacccagt 


420 


acttcaacaa 


ccaccgcaac 


gccgtcttcc 


gcggccgata 


aaaccgatag 


caaatccagt 


480 


ggaatcaagc 


ttggtcccca 


cttcgtcggt 


tccacggcgg 


cgctactggt 


tgctacagcg 


540 


gccgtgtttt 


tccttgtatt 


cccagctgga 


tttgcttcaa 


tagtt 




585 


<210> 36 
<211> 610 
<212> DNA 

<213> Gossypium hirsutum 










<400> 36 
caaacactag 


tagaaggttt 


agttttacaa 


acatggctag 


ttccggtgtc 


cttaagttgg 


60 


tttccatgat 


tctcatggtg 


tgcatgacga 


tgatgagtgc 


acccaaggca 


gccaaagccg 


120 


ccatcacgtg 


cagcgacgtg 


gtgaaccact 


t gatcccgtg 


cttgt cct ac 


gt acaaaacg 


i on 

J. O \J 


gcggtacacc 


cgctgctgca 


tgctgcagtg 


gggtaaaagc 


actctacggc 


gaggttcaga 


240 


cctccccgga 


ccgccaaaac 


gtgtgcaagt 


gcatcaaatc 


ggcggtgaac 


ggaattccgt 


300 


acaccagcaa 


taacctcaat 


ctcgcagccg 


gcctacctgc 


taaatgtggt 


ctccaactcc 


360 


cttacagcat 


cagcccctcc 


actgactgca 


acaaggtgca 


gtgaggttga 


tgatgatgat 


420 



Copy provided by USPTO from the 1FW Image Database on 04/19/2005 



atggaaggag tggaagaagg ttccagctca gctagataaa gtagctagct aaggttaaat 480 
aagctgtgtt ggtgtgttgt tttttagaaa attccatata taatcgggga aagaaaaaaa 540 
aatagaaaat gtactttgta actgtatttc gtatgtgata tatataatgt atcgtaatct 
ttaatttttt 

<210> 37 

<211> 369 

<212> DNA 

<213> Gossypium hirsutum 



600 
610 



60 
120 
180 
240 
300 
360 
369 



atggctagtt ccggtgtcct taagttggtt tccatgattc tcatggtgtg catgacgatg 

atgagtgcac ccaaggcagc caaagccgcc atcacgtgca gcgacgtggt gaaccacttg 

atcccgtgct tgtcctacgt acaaaacggc ggtacacccg ctgctgcatg ctgcagtggg 

gtaaaagcac tctacggcga ggttcagacc tccccggacc gccaaaacgt gtgcaagtgc 
atcaaatcgg cggtgaacgg aattccgtac accagcaata acctcaatct cgcagccggc 

ctacctgcta aatgtggtct ccaactccct tacagcatca gcccctccac . tgactgcaac 
aaggtgcag 

<210> 38 
<211> 88.6 
<212> DNA 

<213> Gossypium hirsutum 
<400> 38 

ccacgcgtcc gcccacgcgt ccgggctcat ttgccaaaaa gaacagacaa tgagatcaag 
aactactgga atacacagtt gaagaaaagg ttgacgacga tagggatcga ccctgcaact 
cacaggccta aaaccgatac cctcggttct actcccaagg atgccgctaa ccttagccac 180 
atggctcaat gggagagtgc tcggttagaa gctgaagcta gattggtgag agagtcgaaa 
cgagtttcaa acccttcgca aaaccaattt aggttcacgt cttcatcggc tcctccactg 
gtaagcaaaa ttgatgttgg tttggctcat gctactaaac cgcaatgcct cgatgtactc 
aaagcttggc aacgtgtagt cactggattg ttcactttca acactgacaa cctccaatct 
ccaacatcga cgtcgagctt cacggaaaac acgttaccaa tctcatctgt cgggttcatt 
gacagctttg tggggaactc aaataacagc tgttgcggaa ataattggga atgtgtggag 
aaatcgagcc aagttgctga attacaggaa agattggata actcaatggg gttgcatgac 



60 
120 



240 
300 
360 
420 
480 
540 
600 



Codv orovided bv USPTO from the 1FW Image Database on 04/19/2005 



780 
840 
886 



atattggatc tctcctcaga agatgtatgg tttcaaggct catacagggc ggaaaatatg 660 
atggaagggt attcggacac gttaatggtt tgtgattctg gggatcatcc gaagagtttg 720 
tcaatggagc ctagacaaaa ctttaatgtt ggaacaagta atgctagtag tttcgaagaa 
aacaagaatt actggaacaa catccttaat tttgcgaatg cttccccttc tggttcttct 
gtcttttgag attaattgtt aagatttgaa ataaataaaa atatat 

<210> 39 
<211> 1353 
<212i> DNA 

<213> Gossypium hirsutum 

iJtccctttc cattgtactg atctttcttt ctttagaatt agtcactcct gttctagatg 
aaggttctct ccccaattct tgcttgccta. gcgcttgctg tggtggcaag tcatgctgct 
ctctcacccg agcaatattg gagctataag ctgccaaata ctccaatgcc aaaggctgtc 
aaagaaattc tacatccaga actgatggag gagaaaagta cctctgtaaa tgtaggaggt 
ggtggtgtaa acgtcaacac aggaaaaggg aagccagcgg gtggcactca tgtgaacgtt 
gggcgcaaag .gagttggagt gaacacggga aagccagggg gtggcactca tgtgaatgtt 
ggaggcaaag gagttggggt gaacactgga aagccaggag gtggcaccca tgtgaacgtt 
ggaggcaaag gtggaggagt atctgtacac accggacaca agggaaagcc agtaaatgtt 
aatgtgagtc cgtttcttta ccaatatgca gccagtgaaa ctcaaatcca tgacgatccg 
aatgtggctc ttttctttct ggaaaaggat ttacaccccg ggcaacaatg agcctgcatt 
tcacttgaaa atacagagaa atccctttct taccttatca aactgccaaa aaaatccgtt 
ttcatttacg aagttgccag aattttcaca agttttcagt gaacctggat cagtgaaggc 
agagatgatg aagaacccat taaggagtgc gaacagccag cgattgaagg agaggaaaaa 
tattgtgcac cctcactgga' gtcaatgatt gactacagca tttccaaact agggaaagtt 
gatcaggcag tctcaacaga agtggaaaaa caaaccccaa cgcacaagta tacaataaca 
gctggagtgc agaagatgac aaatgacaaa gctgtagtgt gccacaagca gaattatgca 
tatgctgtct tctattgcca taaatgagaa acaacaaggg cttacatggt tcctttagag 
ggtgctgacg gaacaaaagc caaagcagta gcagtctgtc acacagatac atcagcatgg 
aacccaaagc atttggcttt tcaagtccta aaagttgagc caggaaccat tcctgtctgc, 
catttccttc ctcgggatca cattgtttgg gtccccaagt aaaagtcctg aagagtagac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
.1140 
1200 



•^A u M i ispTO from the IFW Image Database on 04/19/2005 



1353 



tcatacacta tagtttcatc atagggtgca ttaaaacagc ttaaagcaat ctccagtttg 1260 
ttctataata atatacccac gagtttagtc atgtaaaatc tatccatgaa tcatgttctt 1320 
agtaatggat aaaatgatag tactttctgt ate 

<210> 40- 
<211> 1122 
<212> DNA 

<213> Gossypium hirsutum 

atgaaggttc tctccccaat tettgettge etagegcttg ctgtggtggc aagtcatget 
gctctctcac ccgagcaata ttggagctat aagctgccaa atactccaat gecaaagget 
gtcaaagaaa ttctacatcc agaactgatg gaggagaaaa gtacctctgt aaatgtagga 
ggtggtggtg taaacgtcaa cacaggaaaa gggaagccag cgggtggcac tcatgtgaac 
gttgggcgca aaggagttgg agtgaacacg ggaaagccag ggggtggcac tcatgtgaat 
gttggaggca aaggagttgg ggtgaacact ggaaagccag gaggtggcac ccatgtgaac 
gttggaggca aaggtggagg agtatctgta cacaccggac acaagggaaa gecagtaaat 
gttaatgtga gtccgtttct ttaccaatat geagecagtg aaactcaaat ccatgacgat 
ccgaatgtgg ctcttttctt tctggaaaag gatttacacc ccgggcaaca atgagcctgc 
atttcacttg aaaatacaga gaaatccctt- tcttacctta tcaaactgcc aaaaaaatcc 
gttttcattt acgaagttgc cagaattttc acaagttttc agtgaacctg gatcagtgaa 
ggcagagatg atgaagaacc cattaaggag tgegaacage cagegattga aggagaggaa 
aaatattgtg caccctcact ggagtcaatg attgactaca gcatttccaa actagggaaa 
gttgatcagg cagtctcaac agaagtggaa aaacaaaccc caacgcacaa gtatacaata 
acagctggag tgcagaagat gacaaatgac aaagctgtag tgtgccacaa gcagaattat 
geatatgetg tcttctattg ccataaatga gaaacaacaa gggcttacat ggttccttta 
gagggtgctg aeggaacaaa agecaaagea gtagcagtct gtcacacaga tacatcagca 
tggaacccaa agcatttggc ttttcaagtc ctaaaagttg agecaggaac cattcctgtc 
tgccatttcc ttcctcggga tcacattgtt tgggtcccca ag 

<210> 41 

<211> 1373 

<212> DNA 

*<213> Gossypium hirsutum 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1122 



nroviHed bv USPTO from the IFW Image Database on 04/19/2005 



<220> 

<221> misc^f eature* 

<222> (895) . . (895) 

<223> n » unknown 



<220> 

<221> misc_feature 

<222> (911) . . (911) 

<223> n = unknown 



<220> 

<221> misc_f eature 

<222> (1270) . . (1270) 

<223> n = unknown 



<220> 

<221> misc_f eature 

<222> (1336) . . (1336) 

<223> n = unknown 



<400> 41 
tgctctcaga 


atcaaaggaa 


atggtttttc 


aattcaattt 


tccagttctt 


ctattatgtc 


60 


ttatgttttt 


aatgtgtggc 


agaggcaatg 


cagtaaggga 


tttggaaggg 


aaacatgatt 


120 


ttgaaagcca 


tggcagagac 


gacgaagtgg 


agagtttaga 


tgacaagtac 


gttagcgctt 


180 


actttcatca 


aacttttgat 


tctgcaaatc 


actttgatgg 


aggtgatgaa 


gtgaagaatt 


240 


tagaagacaa 


atattcaacg 


gcttacttcc 


acaaatcgtt 


agattctgga 


aaccatggca 


300 


gagatgacaa 


agcaaagata 


ttggaagaca 


agtatgctac 


tgcgtacttc 


cacaagactt 


360 


ctgtttttga 


aaaccatggt 


gaaggtgaca 


aattaaagag 


tttggaagat 


aaatattccg 


420 


cggcttactt 


tcacaacaca 


caatcttcca 


aaatgatgaa 


ggatcacaac 


atggaacatc 


480 


accaccatta 


ccataaccat 


gttgaaagtg 


cagagatagg 


cttgtitcacc 


attgatgaac 


540 


tacatacctt 


taacgtaggg 


aagaaattac 


ccatcttttt 


cccaataaaa 


aaccactctc 


600 


tttaccctcc 


tttattgcct 


aaacaaattg 


ctgacaccat 


ccctttttca 


tctttccaag 


660 


tttctaatat 


tctacgattc 


ttctcagttt 


ctccggactc 


ccccaaaggc 


aaaagctgtt 


720 


caagatacct 


tcgcaaaatg 


cgaactcgga 


gcagcgcaag 


ggggagaccc 


aaaatctggg 


780 


ctacctcttt 


aaaatcttta 


catgggtttc 


taagcatgca 


tttgggcccc 


atgttgattt 


840 


caagttcata 


agccaaggca 


tccccccata 


ccaaccccac 


tctttcaaag 


ttacncagtt 


900 


ttagaatccc 


ntgaagagat 


tgaatctcca 


aagaaagtag 


catgtcatcc 


aatgccatat 


960 



fww nrovided bv USPTO from the IFW Image Database on 04/19/2005 



.•,4-4-4- =j4-rTr->=>rT 


tttatttctq 


tcactttgat 


gccactgaga 


ttaaagcttt 


caaactccgt 


1020 


4- 4- -j +• rrrrt" o 


atattacqqq 


agataaggtg 


gatgctgttg 


ttctttgcca 


tatggatact 


1080 


4- *— ■ — x y-rfv4— 4— /~tf*t Sk 

tcaggtiggd 


nctctaatca 


tgtcgctttt 


cgcatgcttg 


gtattaagca 


aggaaacact 


1140 


gtttgccatg 


tattttctca 


aggtaatctt 


gtttggatta 


atcagccatc 


ggatatcgct 


1200 


gccggtgcca 


tataagtgtt 


gaactgttcg 


atgtagcact 


catttgccac 


tacgtatcga 


1260 


gaccttatcn 


caatataagt 


atttaagagc 


tagtcttatg 


ttcactaggt 


ttcatggtgt 


1320 


ttcgttaatg 


gtgtgncttt 


ctatctatat 


taagtatcaa 


gtaattaagc 


aat 


1373 



<210> 42 

<211> 1212 

<212> DNA 

<213> Gossypium hirsutum 
<220> 

<221> misc_feature 

<222> (895) . . (895) 

<2 23> n = unknown 



<220> 

<2 21> miscjeature 

<222> (911) . . (911) 

<223> h = unknown 



<400> 42 
tgctctcaga 


atcaaaggaa 


atggtttttc 


aattcaattt 


tccagttctt 


ctattatgtc 


60 


ttatgttttt 


aatgtgtggc 


agaggcaatg 


cagtaaggga 


tttggaaggg 


aaacatgatt 


120 


ttgaaagcca 


tggcagagac 


gacgaagtgg 


agagtttaga 


tgacaagtac 


gttagcgctt 


180 


actttcatca 


aacttttgat 


tctgcaaatc 


actttgatgg 


aggtgatgaa 


gtgaagaatt 


240 


tagaagacaa 


atattcaacg 


gcttacttcc 


acaaatcgtt 


agattctgga 


aaccatggca 


300 


gagatgacaa 


agcaaagata 


ttggaagaca 


agtatgctac 


tgcgtacttc 


cacaagactt 


360 


ctgtttttga 


aaaccatggt 


gaaggtgaca 


aattaaagag 


tttggaagat 


aaatattccg 


420 


cggcttactt 


tcacaacaca 


caatcttcca 


aaatgatgaa 


ggatcacaac 


atggaacatc 


480 


accaccatta 


ccataaccat 


gttgaaagtg 


cagagatagg 
> 


cttgttcacc 


attgatgaac 


540 


tacatacctt 


taacgtaggg 


aagaaattac 


ccatcttttt 


cccaataaaa 


aaccactctc 


600 


tttaccctcc 


tttattgcct 


aaacaaattg 


ctgacaccat 


ccctttttca 


* tctttccaag 


660 


tttctaatat 


tctacgattc 


ttctcagttt 


ctccggactc 


ccccaaaggc 


aaaagctgtt 


720 


caagatacct 


tcgcaaaatg 


cgaactcgga 


gcagcgcaag 


ggggagaccc 


aaaatctggg 


780 



>vwf ^rnuirfftd bv USPTO from the IFW Image Database on 04/19/2005 



ctacctcttt 

caagttcata 

ttagaatccc 

ctttatgcag 

ttagttggtg 

tcaggttgga 

gtttgccatg 

gccggtgcca 



aaaatcttta 

agccaaggca 

ntgaagagat 

tttatttctg 

atgttacggg 

gctctgatca 

tattttctca 

ta 



catgggtttc 
tccccccata 
tgaatctcca 
tcactttgat 
agataaggtg 
tgtcgctttt 
aggtaatctt 



taagcatgca 
ccaaccccac 
aagaaagtag 
gccactgaga 
gatgctgttg 
cgcatgcttg 
gtttggatta 



tttgggcccc 
tctttcaaag 
catgtcatcc 
ttaaagcttt 
ttctttgcca 
gtattaagca 
atcagccatc 



atgttgattt 
ttacncagtt 
aatgccatat 
caaactccgt 
tatggatact 
aggaaacact 
ggatatcgct 



<210> 43 
<211> 1024 
<212> DNA 

<213> Gossypium hirsutum 
<400> 43 

gtataacaga ggcagaatcg accggcataa 

tgcatggtgg tgctggtgtt cacccaaatc 

tactcctcaa tcgttgcctt gatattggaa 

ttgactgaca tgaacttgta ttgagggaat 

tggatcggca cttacggata aagggacgga 

gaacagacca tgcggtgctg tttcgggtaa 
cgacttgaaa tggataaaac accacattca 

• gcgaggaaac agggtgtgga gttggtggac 

gggatgctta agttaacaaa agaagcacac 

ctcaccacct gcggaggcag cgcagacatg 

atctttctct acatcatata aacagtgggt 

tctgccgcta cttgcaccgg tggattaatg 

ccgctgattg gttcagagac ttatgcttgt 

gtgaagccat tatgctaagc actttggcta 

gttgaatctt cctgaagctg tggatatgtg 

gtctattgcc tgtcctatat gggaagtgct 

tggctactga agatggatta tggaagttgt 



aaataaaaat 

tccctagtga 

tctgtgcttt 

ttgaacggat 

ggaaatggaa 

acgacatgga 
tctttgggtt 



gggaggctgg 
aaggcaagag 
ttgttctaac 
cctttgctta 
acttgcttta 



gcaatcgcag 
gaagctgtgc 
ctttccggca 
attccgggcg 
tggatggacc 



agaatccgat 
ttgccggcgc 



aatgaatatt 

tcaatcctgt 

gaaaatcgat 

cgagtcgcac 

aacattatga 

gacttattgg 

cggaagtagc 

at.taaactag 

tggggctgaa 

tgtctgcaaa 



tcattacaga 
actattaccg 
tacgaatgaa 
catacaaaca 
ccggaaagat 
ctgtttatgt 
agcgtgatgg 
actatgtgaa 
tactactggt 
ttgatgttag 



atctcttgct 
cgattatttt 

atacaatgtg 

tatcctaacc 

ctggttacca 

atgtcattgc 

tggtgactcg 

accggtgaat 

aatataaatg 

ggcaaagctg 

atgatatggt 

cttagatgct 



840 
900 
960 
1020 
1080 
1140 
1200 
1212 



60 

120 

180 

240 

300 

360 
" 420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 



ggtc 



<210> 44 

<211> 795 

<212> DNA 

<213> Gossypium hirsutum 



1024 



60 



<400> 44 

gtataacaga ggcagaatcg accggcataa aaataaaaat gggaggctgg gcaatcgcag 
tgcatggtgg "tgctggtgtt cacccaaatc tccctagtga aaggcaagag gaagctgtgc 120 
tactcctcaa tcgttgcctt gatattggaa tctgtgcttt ttgttctaac ctttccggca 180 
ttgactgaca tgaacttgta ttgagggaat ttgaacggat cctttgctta attccgggcg 240 
tggatcggca cttacggata aagggacg.ga ggaaatggaa acttgcttta tggatggacc 300 
gaacagacca tgcggtgctg tttcgggtaa acgacatgga agaatccgat atctcttgct 360 
cgacttgaaa tggataaaac accacattca tctttgggtt ttgccggcgc cgattatttt 420 
gcgaggaaac agggtgtgga gttggtggac aatgaatatt tcattacaga atacaatgtg 
gggatgctta agttaacaaa agaagcacac tcaatcctgt actattaccg tatcctaacc 
ctcaccacct gcggaggcag cgcagacatg gaaaatcgat tacgaatgaa ctggttacca 
atctttctct acatcatata aacagtgggt cgagtcgcac catacaaaca atgtcattgc • 
tctgccgcta cttgcaccgg tggattaatg aacattatga ccggaaagat tggtgactcg 
ccgctgattg gttcagagac ttatgcttgt gacttattgg ctgtttatgt accggtgaat 780 

7 95 

gtgaagccat tatgc 

<210> 45 

<211> 989 

<212> DNA 

<213> Gossypium hirsutum 



480 
540 
600 
660 
720 



<400> 45 

accatacact ccaagacccc aaccattaac cgcacaagaa gaatcggatc ttgaattggc 
acaccaaaga ctgttaaaac tttgccaaaa tgcgcgcagt acaacgttcc tttaaccatt 
gatgccgagg acacgtcgat tcaacccgcc atcgattact tcacgtactc tcggccatca 
tgtatacaaa gatgataacc ccattgtcta cggcacgatg caagcttact tgaaagacgc 24 0 
gagggagcgg ctgtttaaca cggcgaggac ggcggagaag ctggggattc atatggggtt 
taagctggtg agaggcgctt acatgtcgag cgaaaccaag ttggcttctt ccttagggtt 
cgattcgccg gttcacaaca ccattcaaga cacccatgct tgtttcaatg attgtgcttc 



60 
120 
180 



300 
360 
420 



Codv provided by USPTO from the IFW Image Database on 04/19/2005 



600 
660 
720 
780 
840 
900 
960 
989 



gtttatgatt gagaagattg ctgatgggta tggcggactc gttctcgcaa ctcataatct 480 
tgagtcaggg aaattggcag catcgaaagc acgaaattta ggaattgaga aggggaatca 54 0 
aaagcttgaa tttgcacagt tatatggaat gtcggaagcg ctgtcgattg gattgagaaa 
cgcagggttt caagttagca aatacttacc ctatggacca gttgatatgg taatgccata 
ccttttaagg agagccgaag aaaatagagg actcttatca acttcaagcc ttgatagaac 
tctcatggggaaggagttga agagaagatt aaagagcctg caatttgcga agccagagat 
ggcagcttca gcagcaggta gcatgaagat agaaatagga acgccataaa tgaggttttg 
attcatagat ggtttgggat gggcaatttt tgccaacaat gtagaattat gaaaaaaaaa 
taacaatcat tgtaacgttt gggcatttgt cccatgtcaa ttattatttg cattagaaat 
tgaatttttt tctttatttt tgaaaaaaa 

<210>- 46 
<211> 410 
<212> DNA 

<213> Gossypium arboreum 

atcaiggctg ccgtaatgtg caataatgct tgaccaaaga tgatataaaa aaagggaaaa 
gagaagaaaa ggtgttcgtc cgaaaacaaa tttaacgatt aaagaagtca agagcgcacc 
tttcaattca tcctttgcgg tcatggtgtt ttgtaagaag gcaaaatcac caagcctgca 
aggatagtag gttcgggaat tgactttgcc aaagagattt taatattaga tatgttggga 
gaactcccca ttttgtgtag gctaagagtt caatgtagga gtggacttta tactagtcta 
atttcttttc agtttcatgt gttattgttg aagcattagt tattttggac ttattcctcc 
attaacaaac atttgttaat ttctgcttaa ■ aaaaaaaaaa aaaaaaaaaa 

<210> 47 

<211> 665 

<212> DNA 

<213> Gossypium arboreum 
<220> 

<221> misc_feature 

<222> (19).. (19) 

<223> n =* unknown 

<220> 

<221> misc^feature 
<222> (112) - . (112) 



60 
120 
180 
240 
300 
360 
410 



r^+f Kmiwpto from the IFW Imaqe Database on 04/19/2005 



<223> n = unknown 



attacaccct tttcatttnt agatcacatc ataagaagac tgggg.ttgaa aaaccccacc 
tcccatggga gtttcttaag cgatgtgagc gtctcctcct ctgtgtaatc tntgattcaa 
gatccatcca ttatcttccc tctgtattgg ctactgcaac catgatgcac gtcatagacc 
aagttgagct tttcaatccc attgactacc aaaatcagct gctgagtgtt cttaaaatta 
gcaaggaaaa agtaaacgat tgttacaagc tcatccttga tgtatcaaca agaccccagg 
ocoaaggcaa tggtggtgca. tgtaagagga aggtggagga gagggttcct agcagcccta 
gtggagtgat tgatgctgca tttggcagtg atagctcgaa cgattcgtgg ggcacggtgt 
ccttatcgcc tgagcagcag ccacctttta agaagagcag agcccaagag caagtaatgc 
gtttgccatc actcaaccga gtctttgtag acattgttgg cagcccttct taattatatc 
tcccttctct ctctccctcg ctctctccat ctctttcttt gtcccaaaaa gatctatatt 
tattatgctt atgttcactt ttggttcaag -gaatcaaatg ttaagttaaa aaaaaaaaaa 
aaaaa 

<210> 48 

<211> 626 

<212> DNA 

<213> Gossypium hirsutum 
<220> 

<221> misc__f eature 

<222> (581) . . (581) 

<223> n = unknown 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
665 



<400> 48 
cttgtttcta 


tctgtatata 


accaagggaa 


ttagacaccc 


gttcagttga 


aagagttcag 


60 


ctgaacaccc 


caaagatggc 


caaccacacc 


gttacctttc 


tccctaaact 


atccattgaa 


120 


gctattcaga 


cagtgactcc 


gatgaggata 


actgaaccac 


gacagactcg 


acaagtattg 


180 


gcaggggagc 


ttgtaggacc 


cgggattttc 


caaaggtgtt 


tgaacgtggt 


ccagtactac ■ 


240 


atgaaggaga 


aagaagaaga 


ctctggttgg 


ttactggctg 


ggtggatcaa 'ggaaacactt 


300 


gggagagctt 


tacatgagca 


accaatgatt 


tctggtcgtc 


ttcggaaagg 


ggaacgaaac 


360 


gatggagaat 


tggagattgt 


ttccaatgac 


tgcggcatta 


gactcattga 


ggcaaggatt 


420 


cagatgaatc 


tgtcggattt 


tcttgatttg 


aaacaaaggg 


aagatgctga 


agctcagctt 


480 



Cfinv nmviHRd bv USPTO from the IFW Image Database on 04/19/2005 



gttttctgga aagatattga tgagcaaaac ccacagttct ccccactctt ttatgttcag 540 
gttactaatt tccagtgtgg tggatattca attgggatta nctgcagtat tcttctggca 600 
gatcttttgt taatgaaaga attcct 626 

<210> 49 
<211> 644 
<212> DNA 

<213> Gossypium arboreum 
<220> 

<221> misc_f eature. 
<222> (585) . . (585) 
<223> n = unknown 

actgagttaa gagtttcaat tcttctactt attatagtta aatatcatat atggccaagt 
acttgaatgt tgtgcttgtt cttgctctag tagtggttca agctactgca aggaatgtgc 
ctagcgatgc tgctggtctc aatgaccaaa agaacctcct cacatacggt ggcattggcg 
gctactctgg catgggttca aatggcatgc caatgggtgg agttgggagt gttggtggta 
tgactggcct tggtggtaca ggtgggatgg gcgccatggt aggtgttggg tatggaggtg 
ggcctggcgc tggtggtgga aatgaaggtg gtgttggcat tggcaatgcg cctggtgtcg 
tccactttcc ttgaactttg ctggatggtt aaaattttaa agcaactagt ttcttgaact 
ttgctggagg ggtttaaatt ttaaagcaac tagtctaact tacgttaaag agtaatatta 
aagttgctct agagtgtgaa atgttttggg ttatgtgata ggtccatctt tatttttttt 
atgtcgagtt ttcttttgtt ttgtaatcct tcattgtcgt ggttntgtag ccgacttaaa 
gtaaataaat tgattttgac aagttaaaaa aaaaaaaaaa acaa 

<210> 50 
<211> 677 
<212> DNA 

<213> Gossypium arboreum 

gac^ctcaaa tataagtagc aaactaacct atgggttatt tggctgattt tgaagggttc 

atggtgtatt ttggtgcgtg tctgttgaga atccgagttg ttgtcccgtg gtattagctt 
ctStgtcttg ctggttgcga ttgggcagtt gtgaggtcta taatcaagtg attcaaggaa 

accgttagct tcattttact tggagaagac aaagaagcta ttgttgtgct ggacttgttc 24 0 

ttgctttttc tctttgtatg gtgtggttta tggtttgtat tatgagtttt atatgaatag 300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 

644 



60 

120 
180 



Cnmr provided bv USPTO from the IFW Image Database on 04/19/2U05 



aactttgaat 


ttggtgagaa 


aattaagaat 


gagcttggga 


ggagcagaag 


tgttgatggc 


360 


aatagcaggg 


ttgtgggcag 


tggttttgag 


gccattgatg 


ataaggtatg 


ccgtagagat 


420 


gagtcaaatg 


autygaatLL 




agttttcagt 


aatcctcttt 


ccccttccgt 


4 80 


atcgtttttt 


tattggtact 


gatatagaaa 


ttctatgaaa 


tgagcacaat 


atgagacacc 


540 


attttttgct 


agccaagaag 


ttagatgagt 


ggtagacttt 


ggtttaagct 


tatcataatt 


600 


gaaattgtta 


gactgtaacc 


cttttgtctc 


ctttctctaa 


tttcaaatcc 


aaattcccat 


660 


caataaaaaa 


aaaaaaa 










677 



<400> 52 

aagatgatga aaaggggttt tattgttttg gccttgatgg tggttttcgc cgcgacggtg 

gttacggggg ctgacgagag tgggttagcg aatgagtgca gcaaagattt ccagagcgtg 
atgacttgct taagctttgc tcaaggaaaa gcagcgtcgc cgtcgaagga gtgttgtaat 



60 



<210> 51 
<211> 692 
<212> ■ DNA 

<213> Gossypium arboreum 

<400> 51 4. 
ccctacattt ttacgctctg gcacagaaga agaaagccct acctatataa tattacatgc 

aaatataatg gtatcattag acgttatgac atcgtataat gtaggaggca tctgctacta 120 

acatttggca gatgaaatta tttacgaaga acaatgggat attttctgta tttgtttatc 180 

atatctggtt acttcaaagc actggttgca acaacagaaa tcaggtttct gctcttcatt 240 

gccattgcta gggaggacca taaacaaacc attcttggga gatgggaaac cctcttgcca 300 

ttaatgccag aacagtttgc agatattaga agcacttgaa aaataaagct gatacagata 

attccatatg taattctact ataatctctt tctctttgtt tctgtaatca aattccagta 

agagcattac tatagtactc atgatttgtt gattcttcta gtgaattgga gagtttagac 

cctcttgaga agacagtgaa tgtagaactt ggtgcctctt ggatgggaag gctgatgttt 54 0 

tcctaaaaga aggtcgtttt ttcatgtgct gcaatgttcg ccatgtataa agagttcgaa 

acaatggtga gactctctcc ttgaacttga gaatgatgaa gaaaagtaat ctgtaagcta 

tcaaaatcat tataacagct gctaagtccc ac 

<210> 52 

<211> 788 

<212> DNA 

<213> Gossypium arboreum 



360 
420 
480 



600 
660 
692 



60 

120 
180 



Codv orovided by USPTO from the IFW image Database on 04/19/2005 



tcagtggcgg ggattaaaga gaataaaccc aaatgtttgt gttatatttt gcaacaaaca 24 0 
caaacttccg gtgctcaaaa tctcaaaagc ttaggtgttc aagaagataa gctgtttcag 300 
ttaccgtcgg cttgtcaatt gaagaacgct agcgtcagtg attgcccaaa gcttcttggg 360 
ttatctccga gctcaccaga cgccgccatc ttcaccaact cctcctctaa agcaacgaca 420 
cccagtactt caacaaccac cgcaacgccg tcttccgcgg ccgataaaac cgatagcaaa 480 
tccagtggaa tcaagcttgg tccccacttc gtcggttcca cggcggcgct actggttgct 540 
acagcggccg tgtttttcct tgtattccca gctggatttg cttcaatagt ttaggggttc 
tgcatgggat ttcgagattt ggaggtttat ttattgttga agtccatttg tttttaaacg 
gtctcagaaa aaaaatggac tgagttgaca attatgatga tttttcgctt attcttgctt 
tttcttattt gattaaacgt ccctttgaaa taaaacttag tttattttcc cagctttccc 
cctgggaa 



600 
660 
720 
780 
788 



<210> 53 
<211> 634 
<212> DNA 

<213> Gossypium arboreum 
<400> 53 

caaacactag tagaaggttt agttttacaa acatggctag ttccggtgtc cttaagttgg 60 
tttccatgat tctcatcgtg tgcatgacgg tgatgagtgc acccaaggca gccaaagccg 120 
ccatcacgtg cagcgacgtg gtgaaccact tgatcccgtg cttgtcctac gtacaaaacg 180 
gcggtacacc cgctgctgca tgctgcagtg gggtaaaagc actctacggc gaggctcaga 240 
cctccccgga ccgccaaaac gtgtgcaagt gcatcaaatc ggcggtgaac ggaattccgt 300 
acaccagcaa taacctcaat ctcgcagccg gcctacctgc taaatgtggt ctccaactcc 360 
cttacagcat cagcccctcc actgactgca acaaggtgca gtgaggttga tgatgatgat 420 
atggaagaag gagtggaaga aggttccagc tcagctagat aaagtagcta gctaaggtta 480 
aataagctgt gttggtgtgt tgttttttag aaaattccat atataatcgg ggaaagaaaa 540 
aaaaaataga aaatgtactt tgtaactgta tttcgtatgt gatatatata atgtatcgta 
atctttaatt ttttaaaaaa aaaaaaaaaa aaaa 

<210> 54 

<211> 884 

<212> DNA 

<213> Gossypium arboreum 



600 
634 



Copy provided by USPTO from the IFW Image Database on 04/19/2005 



<400> 54 

cagtgaaact caaatccatg aagacccgaa tgtggctctt ttctttctgg aaaaggatat 60 

gcaccccggg gcaacaatga gcctacattt cactgaaaat acagagaaat cagctttctt 120 

accttatcaa actgcccaaa aaataccgtt ttcatctgac aagttgccag aaattttcaa 180 

caagttttca gtgaaacctg gatcactgaa ggcagagatg atgaagaaca caattaagga 24 0 

gtgcgaacag ccagcgattg aaggagagga aaaatattgt gcaacctcac tggagtcaat 300 

gattgactat agcatttcca aactagggaa agttgatcag gcagtctcaa cagaagtgga 360 

aaaacaaacc ccaacgcagc agtatacaat aacagctgga gtgcagaaga tgacaaatgg 4 20 

caaagctgta gtgtgccaca agcagaatta tgcatatgct- gtcttctatt gtcataaatc 480 

agaaacaaca agggcttaca tggttccttt agagggtgct gacggaacaa aagccaaagc ■ 54 0 

agtagcagtc tgccacacag atacatcagc atggaaccca aagcatttgg cttttcaagt 

cctaaaagtt gagccaggaa ccattcctgt ' ctgccatttc cttcctcggg atcacattgt 

ttgggtccct aagtaaaagt cctgaagagt agattcatac actatagttt cttcacagtg 720 

tgcattaaaa cagcttaaag caatatccag tttgttctat aataatatac ccacaagttt 780 

agtcatgtaa aatctatcca tgaatcatgt tcttagtaat ggataaaatg atattacttt 8 40 

ctgtatcaca agggtttggt gataaatgta ttagtatttt aagt 8 84 

<210> 55 
<211> 690 
<212> DNA 

<213> Gossypium arboreum 



600 
660 



<400> 55 
ggagtgctct 


cagaatcaaa 


ggaaatggtt 


tttcaattca 


attttccagt 


tcttctatta 


60 


tgtcttatgt 


ttttaa-tgtg 


tggcagaggc 


aatgcagtaa 


gggatttgga 


agggaaacat 


120 - 


gattttgaaa 


gccatggcag 


agacgacgaa 


gtggagagtt 


tagatgacaa 


gtacgttagc 


180 


gcttactttc 


atcaaacttt 


tgattctgca 


aatcactttg 


atggaggtga 


tgaagtgaag 


240 


aatttagaag 


acaaatattc 


aacggcttac 


ttccacaaat 


cgttagattic 


tggaaaccat 


300 


ggaagagatg 


acaaagcaaa 


gatattggaa 


gacaagtatg 


ctactgcgta 


cttccacaag 


360 


acttctgttt 


ttgaaaacca 


tggtgaaggt 


gacaaattaa 


agagtttgga 


agataaatat 


420 


tccgcggctt 


actttcacaa 


cacacaatct 


tccaaaatga 


tgaaggatca 


caacatggaa 


480 


catcaccacc 


attaccataa 


ccatgttgaa 


agtgcagaga 


taggcttgtt 


caccattgat 


540 


gaactacata 


cctttaacgt 


agggaagaaa 


ttacccatct 


ttttcccaat 


aaaaaaccac 


600 



Copy provided by USPTO from the IFW Image Database on 04/19/2005 



tctctttacc 


ctcctttatt 


gcctaaacaa 


attgctgaca 


ccatcccttt 


ttcat cttcc 


660 


caagtttcta 


atattctacg 


attcttctca 








690 


<210> 56 
<211> 653 
<212> DNA 

<213> Gossypium arboreum 








. 


<400> 56 
ggcacgagca 


gaatgaccgg 


cagagaataa 


aatgggaggc 


tgggcaatcg 


cagtgcatgg 


60 


tggtgctggt 


gtagacccaa 


atctccctaa 


tgaaaggcaa 


gaggaagcta 


aaagactcct 


120 


cactcgttgc 


cttgatattg 


gaatctctgc 


tcttcgttct 


aacctctccg 


ccattgacgt 


180 


cgttgaactt 


gtcgtgaggg 


aattggaaac 


ggatcctttg 


tttaattccg 


ggcgtggatc 


240 


agcacttacg 


gagaaaggga 


cggtggaaat 


ggaagctagt 


attatggatg 


gaccgaagag 


300 


acgatgcggt 


gctgtttcgg 


gtttaacgac 


ggtgaagaat 


ccgatatctc 


ttgctcgact 


360 


tgttatggat 


aaaacaccac 


attcgtattt 


gggttttgcc 


ggcgccgaag 


agtttgcgag 


420 


gaaacagggt 


gtggagttgg 


tggacaatga 


atatttcatt 


acagaagaca 


atgtggggat 


480 


gcttaagtta 


gcaaaagaag 


caaactcaat 


cctgttcgat 


taccgtatcc 


caaccctcac 


540 


cacctgcggt 


ggcggcgcag 


ccatggaaaa 


tcaattacaa 


atgaacggct 


taccaatcag 


600 


tctctacgcc 


ccagaaacag 


taggctgcgt 


tgtagttgac 


aaacaggtca 


ttg 


653 


<210> 57 
<211> 612 
<212> DNA 

<213> Gossypium arboreum 










<400> 57 
tttttttttt 


tttttcataa 


ttctacattg 


ttggcaaaaa 


ttgcccatcc 


caaaccatct 


. 60 


atgaatcaaa 


acctcactta 


tggggtgcct 


atttctatct 


tcatgctacc 


tgctgctgta 


120 


gctgccatct 


ctggcttcgc 


aaattgcagg 


ctctttaatc 


ttctcttcaa 


ctccttcccc 


180 


atgagaggtc tatcaaggct 


tgaagttgat 


aagagtcctc 


tattttcttc 


ggctctcctt 


240 


aaaaggtatg 


gcattaccat 


atcaactggt 


Ct-d L-dyyy u. ct 


rr"t~ a +- +" +■ rrot* 

ciy L Cl l— U v_» l_ 


daL- L L y aa 


300 


cctgcgtttc 


tcaatccaaa 


cgacagcgct 


tccgacattc 


catataactg 


tgcaaattca 


360 


agcttttgat 


tccccttctc 


aattcctaaa 


tttcgtgctt 


tcgatgctgc 


caatttccct 


420 


gactcaagat 


tatgagttgc 


cagaatgagt 


ccgccatacc 


catcagcaat 


cttctcaatc 


480 


ataaacgaag 


cacaatcatt 


gaaacaagca 


tgggtgtctt 


gaatggtgtt 


gtgaaccggc 


540 



Copy provided by USPTO from the IFW Image Database on 04/19/2005 



gaatcgaacc ctaaggaaga agcoaacttg gtttcgctcg acatgtaagc gcctctcacc 
agcttaaacg cc 



600 
612 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 

<400> 
ccc 



58 
3 

RNA 

Artificial Sequence 

Stem loop structure of dsRNA 

58 



<210> 59 

<211> 4 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> ■ Stem loop structure of dsRNA 



<400> 
uucg 



59 



<210> 60 

<211> 5 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Stem loop structure of dsRNA 



<400> 
ccacc 



60 



<210> 
<211> 
<212> 



61 
6 

RNA 



<213> Artificial Sequence 
<220> 

<223> Stem loop structure of dsRNA 

<400> .61 
cuggag 



<210> 62 

<211> 6 
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